Cascade/dCas3 complementation assays for in vivo detection of nucleic acid-guided nuclease edited cells

ABSTRACT

The present disclosure relates to methods and compositions that allow one to identify in vivo edited cells when employing nucleic-acid guided editing. Additionally provided are automated multi-module instruments for performing editing and selection methods and using the compositions.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. Ser. No. 17/123,067, filedDec. 15, 2020, now allowed; which claims priority to U.S. Ser. No.62/949,472, filed Dec. 18, 2019, both of which are incorporated byreference.

FIELD OF THE INVENTION

The present disclosure relates to methods and compositions to allow forin vivo identification of specific nucleic-acid sequences, such asintended edit sequences present in cells when employing nucleic-acidguided editing, as well as automated multi-module instruments forperforming the editing and selection methods.

BACKGROUND OF THE INVENTION

In the following discussion certain articles and methods will bedescribed for background and introductory purposes. Nothing containedherein is to be construed as an “admission” of prior art. Applicantexpressly reserves the right to demonstrate, where appropriate, that thearticles and methods referenced herein do not constitute prior art underthe applicable statutory provisions.

The ability to make precise, targeted changes to the genome of livingcells has been a long-standing goal in biomedical research anddevelopment. Recently, various nucleases have been identified that allowfor manipulation of gene sequences, and hence gene function. Thenucleases include nucleic acid-guided nucleases, which enableresearchers to generate permanent edits in live cells. Of course, it isdesirable to be able to identify cells that have been properly edited ina resulting cell population; however, in many instances the percentageof edited cells resulting from nucleic acid-guided nuclease editing canbe in the single digits.

There is thus a need in the art of nucleic acid-guided nuclease editingfor improved methods, compositions, modules and instruments for rapidand accurate identification in vivo of cells that have been properlyedited. The present disclosure addresses this need.

SUMMARY OF THE INVENTION

This Summary is provided to introduce a selection of concepts in asimplified form that are further described below in the DetailedDescription. This Summary is not intended to identify key or essentialfeatures of the claimed subject matter, nor is it intended to be used tolimit the scope of the claimed subject matter. Other features, details,utilities, and advantages of the claimed subject matter will be apparentfrom the following written Detailed Description including those aspectsillustrated in the accompanying drawings and defined in the appendedclaims.

The present disclosure relates to methods, compositions, modules andautomated multi-module cell processing instruments that allow one togenerate nucleic acid-guided nuclease edited cells and to identify invivo the cells that have been properly edited in the resultingpopulation of cells where the majority—and perhaps the vast majority—ofcells have not been edited. The present methods and compositions employa split protein reporter system that uses a type I CRISPR-Cas system.The split protein reporter system exploits the natural mechanism of Cas3(CAS3) recruitment upon Cascade complex target recognition. Therecruitment of Cas3 to the Cascade complex initiates an intracellularsignal amplification event specific to the high fidelity targeting ofthe Cascade complex to a specified DNA sequence, in this case, a DNAsequence comprising a desired edit. By attaching one-half of a splitprotein to a deactivated Cas3 (dCas3) and the other half of the splitprotein to a protein component of the Cascade, a system is created inwhich the two halves of the split protein only come together when theCascade complex (e.g., Cascade and crRNA) has formed a discriminatoryR-loop and complexed with the correct target DNA sequence. In oneembodiment, the split protein is T7 RNA polymerase (T7 RNAP). Uponrecognition of the Cascade complex fusion and the target sequence (e.g.,an intended edit) and upon recruitment of the deactivatedCas3-N-terminal T7 RNAP fusion protein, the two halves of the split T7RNAP are brought into proximity resulting in an active T7 polymerase.The active T7 polymerase is capable of transcribing, e.g., a codingsequence for a reporter gene under the control of a T7 promoter, whichin turn allows for isolation of a population of cells with intendededits.

Thus, there is provided in one embodiment herein a nucleic acid-guidednuclease editing system comprising: a Cascade-T7-RNAP fusion proteincoding sequence in a vector backbone; a dCas3-T7-RNAP fusion proteincoding sequence in a vector backbone; a sequence for anedit-discriminating (or “edit-targeting”) gRNA in a vector backbone,wherein the edit-discriminating (or “edit-targeting”) gRNA recognizes arationally-designed edited locus in a genome sequence but does notrecognize the locus in the genome sequence in an unedited orincorrectly-edited condition; and a coding sequence for a reporter geneunder the control of a T7 promoter. In some aspects, Cascade-T7 RNAPfusion protein coding sequence comprises the C-terminus of cas5c (intype I-C systems) or the C-terminus of cse1 or casA (in type I-Esystems) and the C-terminus of the T7 RNAP (e.g., amino acids 181-883)while the dCas3-T7 RNAP fusion protein coding sequence comprises theN-terminus of dCas3 and the N-terminus of the T7-RNAP (e.g., amino acids1-179). Alternatively in some aspects, the Cascade-T7 RNAP fusionprotein coding sequence comprises the N-terminus of cas5c (in type I-Csystems) or the N-terminus of cse1 or casA (in type I-E systems) and theN-terminus of the T7 RNAP (e.g., amino acids 1-179) while the dCas3-T7RNAP fusion protein coding sequence comprises the C-terminus of dCas3and the C-terminus of the T7-RNAP (e.g., amino acids 181-883). In someaspects, the reporter gene is a coding sequence for a fluorescentprotein, and some aspects, the fluorescent protein is green fluorescentprotein or blue fluorescent protein. In yet other aspects, the reportergene is a coding sequence for luciferase, and in some aspects, theluciferase is firefly luciferase or Renilla luciferase. In yet otheraspects, the reporter gene encodes for a broccoli or spinach RNAaptamer. In yet other aspects, the reporter gene is a coding sequencefor an antibiotic resistance gene. In yet other aspects, the reportergene is a coding sequence for a cell surface receptor protein.

In some aspects of the nucleic acid-guided nuclease editing system, theCascade-T7-RNAP-C-terminal fusion protein coding sequence, thedCas3-T7-RNAP-N-terminal fusion protein coding sequence, the sequencefor the edit-discriminating gRNA, and coding sequence for a reportergene under the control of a T7 promoter are all on the same vector andin alternative aspects, the Cascade-T7-RNAP-C-terminal fusion proteincoding sequence, the dCas3-T7-RNAP-N-terminal fusion protein codingsequence, the sequence for the edit-discriminating gRNA, and codingsequence for a reporter gene under the control of a T7 promoter are ontwo or more different vectors.

Yet another embodiment provides a cell comprising theCascade-T7-RNAP-C-terminal fusion protein coding sequence in a vectorbackbone; the dCas3-T7-RNAP-N-terminal fusion protein coding sequence ina vector backbone; the sequence for an edit-discriminating gRNA in avector backbone, wherein the edit-discriminating gRNA recognizes arationally-designed edited locus in a genome sequence but does notrecognize the locus in the genome sequence in an unedited orincorrectly-edited condition; and the coding sequence for a reportergene under the control of a T7 promoter.

Yet other embodiments provide a method for in vivo identification ofedited cells comprising: transforming the cells with nucleic acid-guidednuclease editing components comprising a nucleic acid-guided nuclease, agRNA homologous to a genomic locus, and a donor DNA homologous to agenomic locus; transforming the cells with a nucleic acid-guidednuclease editing system comprising a Cascade-T7-RNAP-C-terminal fusionprotein coding sequence in a vector backbone, a dCas3-T7-RNAP-N-terminalfusion protein coding sequence in a vector backbone, a sequence for anedit-discriminating gRNA in a vector backbone, wherein theedit-discriminating gRNA recognizes a rationally-designed edited locusin a genome sequence but does not recognize the locus in the genomesequence in an unedited or incorrectly-edited condition, and a codingsequence for a reporter gene under the control of a T7 promoter;allowing the nucleic acid-guided nuclease, the gRNA, and donor DNA toedit the genomic locus in the cells; allowing theCascade-T7-RNAP-C-terminal fusion protein coding sequence,edit-discriminating gRNA and Cascade-T7-RNAP-C-terminal fusion proteincoding sequence bind to the edited genomic locus thereby reconstitutingT7 RNAP activity; and providing conditions for T7 RNAP to bind andactivate the T7 promoter thereby transcribing the reporter gene.

These aspects and other features and advantages of the invention aredescribed below in more detail.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other features and advantages of the present inventionwill be more fully understood from the following detailed description ofillustrative embodiments taken in conjunction with the accompanyingdrawings in which:

FIG. 1A is a simple process diagram for performing Type I nucleicacid-guided nuclease selection using a Cascade-T7-RNAP-C-terminal fusionprotein, a dCas3-T7-RNAP-N-terminal fusion protein, anedit-discriminating (or “edit-targeting”) gRNA and a reporter gene(collectively, a “split protein reporter system”). FIG. 1B is asimplified schematic of the components of a split protein system where aCascade-T7-RNAP-C-terminal fusion protein in complex with anedit-discriminating (or edit-targeting) gRNA does not bind to a wildtype (e.g., unedited) genomic target sequence. FIG. 1C is a simplifiedschematic of the components of a split protein system where there is anedited genomic target sequence, but the edit is not a desired, intendededit. Thus, an edit-discriminating gRNA transcript in complex withCascade-T7-RNAP-C-terminal fusion does not recognize and bind to thegenomic target sequence, form a Cascade complex and recruit thedCas3-T7-RNAP-N-terminal fusion protein to the incorrectly-edited targetgenomic sequence. FIG. 1D is a simplified schematic of the components ofa split protein system where there is a properly-edited genomic targetsequence, an edit-discriminating gRNA transcript in complex with aCascade-T7-RNAP-C-terminal fusion that binds the properly-edited genomictarget sequence, forms an active R-loop and thereby recruits thedCas3-T7-RNAP-N-terminal fusion protein to the edited genomic sequence.The recruitment of the dCas3-T7-RNAP-N-terminal fusion protein bringsthe N-terminal and C-terminal portions of the T7 RNAP protein intofunctional proximity.

FIGS. 2A-2C depict three different views of an exemplary automatedmulti-module cell processing instrument for performing nucleicacid-guided nuclease editing employing a split protein reporter system.

FIG. 3A depicts one embodiment of a rotating growth vial for use withthe cell growth module described herein and in relation to FIGS. 3B-3D.FIG. 3B illustrates a perspective view of one embodiment of a rotatinggrowth vial in a cell growth module housing. FIG. 3C depicts a cut-awayview of the cell growth module from FIG. 3B. FIG. 3D illustrates thecell growth module of FIG. 3B coupled to LED, detector, and temperatureregulating components.

FIG. 4A depicts retentate (top) and permeate (bottom) members for use ina tangential flow filtration module (e.g., cell growth and/orconcentration module), as well as the retentate and permeate membersassembled into a tangential flow assembly (bottom).

FIG. 4B depicts two side perspective views of a reservoir assembly of atangential flow filtration module. FIGS. 4C-4E depict an exemplary top,with fluidic and pneumatic ports and gasket suitable for the reservoirassemblies shown in FIG. 4B.

FIG. 5A depicts an exemplary combination reagent cartridge andelectroporation device (e.g., transformation module) that may be used ina multi-module cell processing instrument. FIG. 5B is a top perspectiveview of one embodiment of an exemplary flow-through electroporationdevice that may be part of a reagent cartridge.

FIG. 5C depicts a bottom perspective view of one embodiment of anexemplary flow-through electroporation device that may be part of areagent cartridge. FIGS. 5D-5F depict a top perspective view, a top viewof a cross section, and a side perspective view of a cross section of anFTEP device useful in a multi-module automated cell processinginstrument such as that shown in FIGS. 2A-2C.

FIG. 6A depicts a simplified graphic of a workflow for singulating,editing and normalizing cells in a solid wall device. 6B depicts asimplified graphic of a workflow variation for substantiallysingulating, editing and normalizing cells in a solid wall device.

FIGS. 6C-6E depict an embodiment of a solid wall isolation incubationand normalization (SWIIN) module. FIG. 6F depicts the embodiment of theSWIIN module in FIGS. 6C-6E further comprising a heater and a heatedcover.

FIG. 7 is a simplified process diagram of an embodiment of an exemplaryautomated multi-module cell processing instrument in which the splitprotein reporter system described herein may be used.

FIG. 8A-8D comprise exemplary vector maps for testing the split proteinreporter system in E. coli.

It should be understood that the drawings are not necessarily to scale,and that like reference numbers refer to like features.

DETAILED DESCRIPTION

All of the functionalities described in connection with one embodimentof the methods, devices or instruments described herein are intended tobe applicable to the additional embodiments of the methods, devices andinstruments described herein except where expressly stated or where thefeature or function is incompatible with the additional embodiments. Forexample, where a given feature or function is expressly described inconnection with one embodiment but not expressly mentioned in connectionwith an alternative embodiment, it should be understood that the featureor function may be deployed, utilized, or implemented in connection withthe alternative embodiment unless the feature or function isincompatible with the alternative embodiment.

The practice of the techniques described herein may employ, unlessotherwise indicated, conventional techniques and descriptions ofmolecular biology (including recombinant techniques), cell biology,biochemistry, and genetic engineering technology, which are within theskill of those who practice in the art. Such conventional techniques anddescriptions can be found in standard laboratory manuals such as Greenand Sambrook, Molecular Cloning: A Laboratory Manual. 4th, ed., ColdSpring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (2014);Current Protocols in Molecular Biology, Ausubel, et al. eds., (2017);Neumann, et al., Electroporation and Electrofusion in Cell Biology,Plenum Press, New York, 1989; and Chang, et al., Guide toElectroporation and Electrofusion, Academic Press, California (1992),all of which are herein incorporated in their entirety by reference forall purposes. Nucleic acid-guided nuclease techniques can be found in,e.g., Genome Editing and Engineering from TALENs and CRISPRs toMolecular Surgery, Appasani and Church (2018); and CRISPR: Methods andProtocols, Lindgren and Charpentier (2015); both of which are hereinincorporated in their entirety by reference for all purposes.

Note that as used herein and in the appended claims, the singular forms“a,” “an,” and “the” include plural referents unless the context clearlydictates otherwise. Thus, for example, reference to “a cell” refers toone or more cells, and reference to “the system” includes reference toequivalent steps, methods and devices known to those skilled in the art,and so forth. Additionally, it is to be understood that terms such as“left,” “right,” “top,” “bottom,” “front,” “rear,” “side,” “height,”“length,” “width,” “upper,” “lower,” “interior,” “exterior,” “inner,”“outer” that may be used herein merely describe points of reference anddo not necessarily limit embodiments of the present disclosure to anyparticular orientation or configuration. Furthermore, terms such as“first,” “second,” “third,” etc., merely identify one of a number ofportions, components, steps, operations, functions, and/or points ofreference as disclosed herein, and likewise do not necessarily limitembodiments of the present disclosure to any particular configuration ororientation.

Unless defined otherwise, all technical and scientific terms used hereinhave the same meaning as commonly understood by one of ordinary skill inthe art to which this invention belongs. All publications mentionedherein are incorporated by reference for the purpose of describing anddisclosing devices, formulations and methodologies that may be used inconnection with the presently described invention.

Where a range of values is provided, it is understood that eachintervening value, between the upper and lower limit of that range andany other stated or intervening value in that stated range isencompassed within the invention. The upper and lower limits of thesesmaller ranges may independently be included in smaller ranges, and arealso encompassed within the invention, subject to any specificallyexcluded limit in the stated range. Where the stated range includes oneor both of the limits, ranges excluding either or both of those includedlimits are also included in the invention.

In the following description, numerous specific details are set forth toprovide a more thorough understanding of the present invention. However,it will be apparent to one of skill in the art that the presentinvention may be practiced without one or more of these specificdetails. In other instances, features and procedures well known to thoseskilled in the art have not been described in order to avoid obscuringthe invention. The terms used herein are intended to have the plain andordinary meaning as understood by those of ordinary skill in the art.

The term “complementary” as used herein refers to Watson-Crick basepairing between nucleotides and specifically refers to nucleotideshydrogen-bonded to one another with thymine or uracil residues linked toadenine residues by two hydrogen bonds and cytosine and guanine residueslinked by three hydrogen bonds. In general, a nucleic acid includes anucleotide sequence described as having a “percent complementarity” or“percent homology” to a specified second nucleotide sequence. Forexample, a nucleotide sequence may have 80%, 90%, or 100%complementarity to a specified second nucleotide sequence, indicatingthat 8 of 10, 9 of 10 or 10 of 10 nucleotides of a sequence arecomplementary to the specified second nucleotide sequence. For instance,the nucleotide sequence 3′-TCGA-5′ is 100% complementary to thenucleotide sequence 5′-AGCT-3′; and the nucleotide sequence 3′-TCGA-5′is 100% complementary to a region of the nucleotide sequence5′-TAGCTG-3′.

The term “Cascade” or “Cascade effector” refers to the protein effectorcomplexes of type I CRISPR-Cas systems. The term “Cascade complex”refers to the Cascade effector complex further comprising a crRNA.

The term DNA “control sequences” refers collectively to promotersequences, polyadenylation signals, transcription termination sequences,upstream regulatory domains, origins of replication, internal ribosomeentry sites, nuclear localization sequences, enhancers, and the like,which collectively provide for the replication, transcription andtranslation of a coding sequence in a recipient cell. Not all of thesetypes of control sequences need to be present so long as a selectedcoding sequence is capable of being replicated, transcribed and—for somecomponents—translated in an appropriate host cell.

As used herein the term “donor DNA” or “donor nucleic acid” refers tonucleic acid that is designed to introduce a DNA sequence modification(insertion, deletion, substitution) into a locus (e.g., a target genomicDNA sequence or cellular target sequence) by homologous recombinationusing nucleic acid-guided nucleases. For homology-directed repair, thedonor DNA must have sufficient homology to the regions flanking the “cutsite” or site to be edited in the genomic target sequence. The length ofthe homology arm(s) will depend on, e.g., the type and size of themodification being made. In many instances and preferably, the donor DNAwill have two regions of sequence homology (e.g., two homology arms) tothe genomic target locus. Preferably, an “insert” region or “DNAsequence modification” region—the nucleic acid modification that onedesires to be introduced into a genome target locus in a cell—will belocated between two regions of homology. The DNA sequence modificationmay change one or more bases of the target genomic DNA sequence at onespecific site or multiple specific sites. A change may include changing1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300,400, or 500 or more base pairs of the genomic target sequence. Adeletion or insertion may be a deletion or insertion of 1, 2, 3, 4, 5,10, 15, 20, 25, 30, 40, 50, 75, 100, 150, 200, 300, 400, or 500 or morebase pairs of the genomic target sequence.

The terms “guide nucleic acid” or “guide RNA” or “gRNA” refer to apolynucleotide comprising 1) a guide sequence capable of hybridizing toa genomic target locus, and 2) a scaffold sequence capable ofinteracting or complexing with a nucleic acid-guided nuclease.

“Homology” or “identity” or “similarity” refers to sequence similaritybetween two peptides or, more often in the context of the presentdisclosure, between two nucleic acid molecules. The term “homologousregion” or “homology arm” refers to a region on the donor DNA with acertain degree of homology with the target genomic DNA sequence.Homology can be determined by comparing a position in each sequencewhich may be aligned for purposes of comparison. When a position in thecompared sequence is occupied by the same base or amino acid, then themolecules are homologous at that position. A degree of homology betweensequences is a function of the number of matching or homologouspositions shared by the sequences.

“Operably linked” refers to an arrangement of elements where thecomponents so described are configured so as to perform their usualfunction. Thus, control sequences operably linked to a coding sequenceare capable of effecting the transcription, and in some cases, thetranslation, of a coding sequence. The control sequences need not becontiguous with the coding sequence so long as they function to directthe expression of the coding sequence. Thus, for example, interveninguntranslated yet transcribed sequences can be present between a promotersequence and the coding sequence and the promoter sequence can still beconsidered “operably linked” to the coding sequence. In fact, suchsequences need not reside on the same contiguous DNA molecule (i.e.chromosome) and may still have interactions resulting in alteredregulation.

As used herein, the terms “protein” and “polypeptide” are usedinterchangeably. Proteins may or may not be made up entirely of aminoacids.

A “promoter” or “promoter sequence” is a DNA regulatory region capableof binding RNA polymerase and initiating transcription of apolynucleotide or polypeptide coding sequence such as messenger RNA,ribosomal RNA, small nuclear or nucleolar RNA, guide RNA, or any kind ofRNA transcribed by any class of any RNA polymerase I, II or III.Promoters may be constitutive or inducible.

As used herein a “reporter gene” is a gene used as an indicator for geneexpression and other cellular events, such as, e.g., genes coding forluciferase or fluorescent proteins. In the present context, a reportergene is used as a proxy for genome editing.

As used herein the term “selectable marker” refers to a gene introducedinto a cell, which confers a trait suitable for artificial selection.General use selectable markers are well-known to those of ordinary skillin the art and include ampicillin/carbenicillin, kanamycin,chloramphenicol, nourseothricin N-acetyl transferase, erythromycin,tetracycline, gentamicin, bleomycin, streptomycin, puromycin,hygromycin, blasticidin, and G418 or other selectable markers may beemployed.

The term “specifically binds” as used herein includes an interactionbetween two molecules, e.g., an engineered peptide antigen and a bindingtarget, with a binding affinity represented by a dissociation constantof about 10⁻⁷M, about 10⁻⁸M, about 10⁻⁹ M, about 10⁻¹⁰ M, about 10⁻¹¹M,about 10⁻¹²M, about 10⁻¹³M, about 10⁻¹⁴M or about 10⁻¹⁵M.

The terms “target genomic DNA sequence”, “cellular target sequence”,“target sequence”, or “genomic target locus” refer to any locus in vitroor in vivo, or in a nucleic acid (e.g., genome or episome) of a cell orpopulation of cells, in which a change of at least one nucleotide isdesired using a nucleic acid-guided nuclease editing system. The targetsequence can be a genomic locus or extrachromosomal locus. The term“edited target sequence” or “edited locus” refers to a target genomicsequence or target sequence after editing has been performed, where theedited target sequence comprises the desired edit.

The term “variant” may refer to a polypeptide or polynucleotide thatdiffers from a reference polypeptide or polynucleotide but retainsessential properties. A typical variant of a polypeptide differs inamino acid sequence from another reference polypeptide. Generally,differences are limited so that the sequences of the referencepolypeptide and the variant are closely similar overall and, in manyregions, identical. A variant and reference polypeptide may differ inamino acid sequence by one or more modifications (e.g., substitutions,additions, and/or deletions). A variant of a polypeptide may be aconservatively modified variant. A substituted or inserted amino acidresidue may or may not be one encoded by the genetic code (e.g., anon-natural amino acid). A variant of a polypeptide may be naturallyoccurring, such as an allelic variant, or it may be a variant that isnot known to occur naturally.

A “vector” is any of a variety of nucleic acids that comprise a desiredsequence or sequences to be delivered to and/or expressed in a cell.Vectors are typically composed of DNA, although RNA vectors are alsoavailable. Vectors include, but are not limited to, plasmids, fosmids,phagemids, virus genomes, synthetic chromosomes, and the like. In someembodiments of the present methods, two vectors—an engine vector,comprising the coding sequences for a nuclease, and an editing vector,comprising the gRNA sequence and the donor DNA sequence—are used. Inalternative embodiments, all editing components, including the nuclease,gRNA sequence, and donor DNA sequence are all on the same vector (e.g.,a combined editing/engine vector). In some embodiments, the codingsequences for the Cascade-T7-RNAP-C-terminal fusion protein, thedCas3-T7-RNAP-N-terminal fusion protein, edit-discriminating gRNA andthe reporter gene under the control of a T7 promoter are all located ona single reporter vector, but in other embodiments, one or more of thesecomponents may be located on the engine vector, the editing vector, orone or more different reporter vectors.

Nuclease-Directed Genome Editing Generally

The compositions and methods described herein are employed to allow oneto perform nuclease-directed genome editing to introduce desired editsto a population of cells and then allow one to quickly identify editedcells in vivo. In some embodiments, recursive cell editing is performedwhere edits are introduced in successive rounds of editing to cells thathave been edited in previous rounds. A nucleic acid-guided nucleasecomplexed with an appropriate synthetic guide nucleic acid in a cell cancut the genome of the cell at a desired location. The guide nucleic acidhelps the nucleic acid-guided nuclease recognize and cut the DNA at aspecific target sequence. By manipulating the nucleotide sequence of theguide nucleic acid, the nucleic acid-guided nuclease may be programmedto target any DNA sequence for cleavage as long as an appropriateprotospacer adjacent motif (PAM) is nearby. In certain aspects, thenucleic acid-guided nuclease editing system may use two separate guidenucleic acid molecules that combine to function as a guide nucleic acid,e.g., a CRISPR RNA (crRNA) and trans-activating CRISPR RNA (tracrRNA).In other aspects and preferably, the guide nucleic acid is a singleguide nucleic acid construct that includes both 1) a guide sequencecapable of hybridizing to a genomic target locus, and 2) a scaffoldsequence capable of interacting or complexing with a nucleic acid-guidednuclease.

In general, a guide nucleic acid (e.g., gRNA) complexes with acompatible nucleic acid-guided nuclease and can then hybridize with atarget sequence, thereby directing the nuclease to the target sequence.A guide nucleic acid can be DNA or RNA; alternatively, a guide nucleicacid may comprise both DNA and RNA. In some embodiments, a guide nucleicacid may comprise modified or non-naturally occurring nucleotides. Incases where the guide nucleic acid comprises RNA, the gRNA may beencoded by a DNA sequence on a polynucleotide molecule such as aplasmid, linear construct, or the coding sequence may and preferablydoes reside within an editing cassette. Methods and compositions fordesigning and synthesizing editing cassettes are described in USPNs10,240,167; 10,266,849; 9,982,278; 10,351,877; 10,364,442; and10,435,715; and U.S. Ser. No. 16/275,465, filed 14 Feb. 2019, all ofwhich are incorporated by reference herein.

A guide nucleic acid comprises a guide sequence, where the guidesequence is a polynucleotide sequence having sufficient complementaritywith a target sequence to hybridize with the target sequence and directsequence-specific binding of a complexed nucleic acid-guided nuclease tothe target sequence. The degree of complementarity between a guidesequence and the corresponding target sequence, when optimally alignedusing a suitable alignment algorithm, is about or more than about 50%,60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment maybe determined with the use of any suitable algorithm for aligningsequences. In some embodiments, a guide sequence is about or more thanabout 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25,26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length.In some embodiments, a guide sequence is less than about 75, 50, 45, 40,35, 30, 25, 20 nucleotides in length. Preferably the guide sequence is10-30 or 15-20 nucleotides long, or 15, 16, 17, 18, 19, or 20nucleotides in length.

In general, to generate an edit in the target sequence, thegRNA/nuclease complex binds to a target sequence as determined by theguide RNA, and the nuclease recognizes a protospacer adjacent motif(PAM) sequence adjacent to the target sequence. The target sequence canbe any polynucleotide endogenous or exogenous to the cell, or in vitro.For example, the target sequence can be a polynucleotide residing in thenucleus of the cell. A target sequence can be a sequence encoding a geneproduct (e.g., a protein) or a non-coding sequence (e.g., a regulatorypolynucleotide, an intron, a PAM, a control sequence, or “junk” DNA).

The guide nucleic acid may be and preferably is part of an editingcassette that encodes the donor nucleic acid that targets a cellulartarget sequence. Alternatively, the guide nucleic acid may not be partof the editing cassette and instead may be encoded on the editing vectorbackbone. For example, a sequence coding for a guide nucleic acid can beassembled or inserted into a vector backbone first, followed byinsertion of the donor nucleic acid in, e.g., an editing cassette. Inother cases, the donor nucleic acid in, e.g., an editing cassette can beinserted or assembled into a vector backbone first, followed byinsertion of the sequence coding for the guide nucleic acid. Preferably,the sequence encoding the guide nucleic acid and the donor nucleic acidare located together in a rationally-designed editing cassette and aresimultaneously inserted or assembled via gap repair into a linearplasmid or vector backbone to create an editing vector.

The target sequence is associated with a proto-spacer mutation (PAM),which is a short nucleotide sequence recognized by the gRNA/nucleasecomplex. The precise preferred PAM sequence and length requirements fordifferent nucleic acid-guided nucleases vary; however, PAMs typicallyare 2-7 base-pair sequences adjacent or in proximity to the targetsequence and, depending on the nuclease, can be 5′ or 3′ to the targetsequence. Engineering of the PAM-interacting domain of a nucleicacid-guided nuclease may allow for alteration of PAM specificity,improve target site recognition fidelity, decrease target siterecognition fidelity, or increase the versatility of a nucleicacid-guided nuclease.

In most embodiments, the genome editing of a cellular target sequenceboth introduces a desired DNA change to a cellular target sequence,e.g., the genomic DNA of a cell, and removes, mutates, or rendersinactive a proto-spacer mutation (PAM) region in the cellular targetsequence (e.g., renders the target site immune to further nucleasebinding). Rendering the PAM at the cellular target sequence inactiveprecludes additional editing of the cell genome at that cellular targetsequence, e.g., upon subsequent exposure to a nucleic acid-guidednuclease complexed with a synthetic guide nucleic acid in later roundsof editing. Thus, cells having the desired cellular target sequence editand an altered PAM can be selected for by using a nucleic acid-guidednuclease complexed with a synthetic guide nucleic acid complementary tothe cellular target sequence. Cells that did not undergo the firstediting event will be cut rendering a double-stranded DNA break, andthus will not continue to be viable. The cells containing the desiredcellular target sequence edit and PAM alteration will not be cut, asthese edited cells no longer contain the necessary PAM site and willcontinue to grow and propagate.

As for the nuclease component of the nucleic acid-guided nucleaseediting system, a polynucleotide sequence encoding the nucleicacid-guided nuclease can be codon optimized for expression in particularcell types, such as bacterial, yeast, and mammalian cells. The choice ofthe nucleic acid-guided nuclease to be employed depends on many factors,such as what type of edit is to be made in the target sequence andwhether an appropriate PAM is located close to the desired targetsequence. Nucleases of use in the methods described herein include butare not limited to Cas 9, Cas 12/CpfI, MAD2, or MAD7 or other MADzymes.

Another component of the nucleic acid-guided nuclease system is thedonor nucleic acid comprising homology to the cellular target sequence.The donor nucleic acid is on the same vector and even in the sameediting cassette as the guide nucleic acid and preferably is (but notnecessarily is) under the control of the same promoter as the editinggRNA (that is, a single promoter driving the transcription of both theediting gRNA and the donor nucleic acid). The donor nucleic acid isdesigned to serve as a template for homologous recombination with acellular target sequence nicked or cleaved by the nucleic acid-guidednuclease as a part of the gRNA/nuclease complex. A donor nucleic acidpolynucleotide may be of any suitable length, such as about or more thanabout 20, 25, 50, 75, 100, 150, 200, 500, or 1000 nucleotides in length,and up to 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 and up to 20 kb inlength if combined with a dual gRNA architecture as described in U.S.Ser. No. 16/275,465, filed 14 Feb. 2019. In certain preferred aspects,the donor nucleic acid can be provided as an oligonucleotide of between20-300 nucleotides, more preferably between 50-250 nucleotides. Thedonor nucleic acid comprises a region that is complementary to a portionof the cellular target sequence (e.g., a homology arm). When optimallyaligned, the donor nucleic acid overlaps with (is complementary to) thecellular target sequence by, e.g., about 20, 25, 30, 35, 40, 50, 60, 70,80, 90 or more nucleotides. In many embodiments, the donor nucleic acidcomprises two homology arms (regions complementary to the cellulartarget sequence) flanking the mutation or difference between the donornucleic acid and the cellular target sequence. The donor nucleic acidcomprises at least one mutation or alteration compared to the cellulartarget sequence, such as an insertion, deletion, modification, or anycombination thereof compared to the cellular target sequence.

As described in relation to the gRNA, the donor nucleic acid ispreferably provided as part of a rationally-designed editing cassette,which is inserted into an editing plasmid backbone (in yeast, preferablya linear plasmid backbone) where the editing plasmid backbone maycomprise a promoter to drive transcription of the editing gRNA and thedonor DNA when the editing cassette is inserted into the editing plasmidbackbone. Moreover, there may be more than one, e.g., two, three, four,or more editing gRNA/donor nucleic acid rationally-designed editingcassettes inserted into an editing vector; alternatively, a singlerationally-designed editing cassette may comprise two to several editinggRNA/donor DNA pairs, where each editing gRNA is under the control ofseparate different promoters, separate like promoters, or where allgRNAs/donor nucleic acid pairs are under the control of a singlepromoter. In some embodiments the promoter driving transcription of theediting gRNA and the donor nucleic acid (or driving more than oneediting gRNA/donor nucleic acid pair) is optionally an induciblepromoter.

In addition to the donor nucleic acid, an editing cassette may compriseone or more primer sites. The primer sites can be used to amplify theediting cassette by using oligonucleotide primers; for example, if theprimer sites flank one or more of the other components of the editingcassette. In addition, the editing cassette may comprise a barcode. Abarcode is a unique DNA sequence that corresponds to the donor DNAsequence such that the barcode can identify the edit made to thecorresponding cellular target sequence. The barcode typically comprisesfour or more nucleotides. In addition, the editing cassette may comprisea set of FLP/FRT or Cre/Lox recombination sites that enable controlleddeletion of the donor DNA and or gRNA while preserving the barcode. Insome embodiments, the editing cassettes comprise a collection or libraryediting gRNAs and of donor nucleic acids representing, e.g., gene-wideor genome-wide libraries of editing gRNAs and donor nucleic acids. Thelibrary of editing cassettes is cloned into vector backbones where,e.g., each different donor nucleic acid is associated with a differentbarcode. Also, in preferred embodiments, an editing vector or plasmidencoding components of the nucleic acid-guided nuclease system furtherencodes a nucleic acid-guided nuclease comprising one or more nuclearlocalization sequences (NLSs), such as about or more than about 1, 2, 3,4, 5, 6, 7, 8, 9, 10, or more NLSs, particularly as an element of thenuclease sequence. In some embodiments, the engineered nucleasecomprises NLSs at or near the amino-terminus, NLSs at or near thecarboxy-terminus, or a combination.

Increasing Efficiency of Identifying Nuclease-Directed Edited Cells InVivo Via a Split Protein Reporter System

The present disclosure is drawn to increasing the efficiency of in vivodetection of edits made to live cells after nucleic acid-guided nucleaseediting has been performed. Genome editing using nucleic acid-guidednuclease editing technology requires precise repair of nuclease-inducedDNA strand breaks (e.g., double-strand breaks or single-strand nicks)via homologous recombination with an editing vector. Double-strand DNAbreaks in cells caused by nucleic acid-guided nucleases have three mainoutcomes: 1) cell death if the break is not repaired; 2) non-homologousend joining (NHEJ), which repairs the break without a homologous repairtemplate often leading to indels; and 3) homologous recombination (HR),which uses auxiliary (here, exogenous) homologous DNA (e.g., a donor DNAsequence from an editing cassette inserted into the editing vector) torepair the break. The present methods and compositions are drawn to invivo identification of cells that have been edited by HR.

The present methods and compositions utilize, in addition to a nucleicacid-guided nuclease editing system as described above, a split proteinreporter system comprising a type I CRISPR-Cas system, two fusionconstructs, an edit-discriminating or edit-targeting gRNA and a reportergene under the control of a T7 promoter. The core feature of CRISPR-Castypes and subtypes are different cas proteins, which are highlygenetically and functionally diverse. There are three major types ofCRISPR-Cas systems, which are distinguished from one another by uniquesignature genes: Cas3 in type I systems, Cas9 in type II systems, andCas10 in type III systems. Type I CRISPR systems utilize a two-componentstructure to degrade target DNA, which is exploited in the presentmethods and compositions to provide a handle for identifying editedcells in vivo. In type I systems, the signature gene, Cas3, encodes alarge protein with helicase activity.

The effector complexes of type I CRISPR-Cas systems display elaboratearchitectures, made up of Cas5c, Cas7c and Cas8c protein subunits inType I-C systems, and CasA/Cse1, Cse2, Cas7e, Cas5e, and Cas6e in TypeI-E systems. The Cas5c subunit binds the 5′-handle of the crRNA andinteracts with the large Cas8c subunit. The Type I-C Cascade complex(made up of Cas5c, Cas7c, Cas8C and crRNA) binds to a target DNAsequence and forms an R-loop complex with the RNA guide and target DNAstrand. After R-loop formation, the Cascade complex recruits Cas3 whichnicks the non-target strand and begins processive DNA degradation.Either Type I-C or Type I-E systems may be utilized, although Type-I-Csystems are preferred due to their smaller size, e.g., resulting in areduced payload delivery to the cells of choice.

The present methods and compositions are drawn to the natural mechanismof Cas3 recruitment upon Cascade complex target recognition to initiatean intracellular signal amplification event specific to the highfidelity targeting of the Cascade complex to a specified DNA sequence.By attaching one-half of a split protein to a deactivated Cas3 (dCas3)and the other half of the split protein to a protein component of theCascade, a system is created in which the two split portions of theprotein only come together when the Cascade complex has formed adiscriminatory R-loop and complexed with the correct target DNAsequence.

Many different proteins may be used according to the present methods;however, one of particular interest is the T7 RNA polymerase (T7 RNAP)which has already been validated for use in protein complementationassays. (See, e.g., Shis, et al., PNAS, 110: and 5028-5033 (2013); andPu, et al., Nat Chem Biol 13:432-438 (2017).) In the case of split T7RNAP, in a preferred embodiment the C-terminus T7 RNAP fragment (e.g.,amino acids 181-883) is fused to the Cascade complex via the C-terminusof the Cas5c (in a Type-I-C system) or the C-terminus of the casA/cse1protein (in a Type-I-E system) and the N-terminus of the T7 RNAPfragment (e.g., amino acids 1-179) is fused to the N-terminus of thedeactivated dCas3 protein. The cascade proteins comprising theC-terminus of the T7 RNAP complexes with an edit-discriminating crRNAforming a Cascade complex. Upon recognition of the Cascade complex andthe target sequence (e.g., where the target sequence comprises anintended edit and an R-loop is formed) and upon recruitment of dCas3fused to the N-terminus of T7 RNAP, the two halves of the split T7 RNAPare brought into proximity resulting in an active T7 polymerase. Theactive T7 polymerase is capable of transcribing linear or circular dsDNAfragments introduced into the cell.

In the present methods, the linear dsDNA to be transcribed by the T7polymerase comprises a coding sequence for a reporter gene such as, inan exemplary embodiment, a luciferase gene (or a fluorescent proteinsuch as green fluorescent protein coding sequence, an antibioticresistance gene, a gene coding for a cell surface marker, etc.) underthe control of a T7 promoter. The reconstituted T7 polymerase binds theT7 promoter and transcribes the reporter gene coding sequence. In thecase where the reporter gene codes for luciferase, the transcribedsequence is translated into the luciferase enzyme. If the substrate ofluciferase, luciferin, is present, luciferase will catalyze a two-stepoxidation process to yield light, which results in enough fluorescenceto sort cells in a FACS, which in turn allows for isolation of apopulation of cells with intended edits.

It should be noted that one of ordinary skill in the art given thepresent disclosure, that although the discussion herein focuses onfusing the C-terminal portion of T7 RNAP to the C-terminus of thecascade protein complex (e.g., the C-terminus of cas5c in type I-Csystems or the C-terminus of cse1 or casA in type I-E systems) and theN-terminal portion of the T7 RNAP to the N-terminus of dCas3,alternative embodiments envision fusing the C-terminal portion of T7RNAP to the C-terminus of dCas3 and the N-terminal portion of the T7RNAP to the N-terminus of the cascade protein complex (e.g., theN-terminus of cas5c in type I-C systems or the N-terminus of cse1 orcasA in type I-E systems). Additionally, other combinations of a splitprotein reporter system utilizing a type I CRISPR-Cas system may beenvisioned as long as the combinations involve the formation of acascade complex upon recognition of an intended edit in a target DNAsequence, formation of an R-loop, recruitment of dCas3 andreconstitution of activity of the split protein.

FIG. 1A is a simple process diagram for in vivo detection of cells thathave been properly edited via nucleic acid-guided nuclease editing. In afirst step of method 100, Cascade-T7-RNAP-C-terminal fusion anddCas3-T7-RNAP-N-terminal fusion constructs are synthesized 102. Theappropriate “split” between the N-terminal and C-terminal portions ofthe T7 RNAP to produce a reconstituted polymerase once the two portionsare in proximity with one another may be determined empirically. Thepolymerase is “split” at a point where there is no spontaneousassociation between the N-terminal and C-terminal portions of the T7RNAP in the absence of physical proximity due to the association ofdCas3 with the Cascade complex; however, the “split” must allowassociation of the N-terminal and C-terminal portions of thepolymerase—and reconstitution of polymerase activity—in the presence ofthe association of dCas3 with the Cascade complex. Once the proper“split” for the polymerase is determined, appropriate Cascade-T7-RNAPfusion and dCas3-T7-RNAP fusion constructs can be designed andsynthesized. In one embodiment, the N-terminal portion of the T7 RNAPcomprises approximately amino acids 1-179 of the T7 RNAP protein and theC-terminal portion of the T7 RNAP comprises approximately amino acids181-883 of the T7 RNAP protein. In one exemplary embodiment, theC-terminal portion of T7 RNAP is fused to the C-terminus of the cascadeprotein complex (e.g., the C-terminus of cas5c in type I-C systems orthe C-terminus of cse1 or casA in type I-E systems) and the N-terminalportion of the T7 RNAP if fused to the N-terminus of dCas3.

Once synthesized, the appropriate Cascade-T7-RNAP-C-terminal fusion anddCas3-T7-RNAP-N-terminal fusion constructs are inserted into a vector104 to be transformed 110 into cells of choice. In this exemplaryembodiment, the cells of choice comprise the coding sequence of areporter gene under the control of a T7 promoter and comprise a sequencefor the edit-discriminating or edit-targeting gRNA. However, inalternative embodiments, the coding sequence of the reporter gene underthe control of the T7 promoter and/or the edit-discriminating gRNA maybe located on a reporter vector with the fusion constructs (as describedbelow) or on the engine or editing vector (where these vectors aredescribed briefly below). Also in this exemplary embodiment, theCascade-T7-RNAP-C-terminal fusion and dCas3-T7-RNAP-N-terminal fusionconstructs are located on the engine vector with the nucleic acid-guidednuclease coding sequence. In alternative embodiments, theCascade-T7-RNAP-C-terminal fusion and dCas3-T7-RNAP-N-terminal fusionconstructs may be contained on a single reporter vector separate fromthe engine vector (along with, e.g., the coding sequence for thereporter gene under the control of the T7 promoter and/or theedit-discriminating gRNA) where both fusion constructs are under thecontrol of the same promoter or under the control of differentpromoters. In yet another embodiment, the Cascade-T7-RNAP-C-terminalfusion and dCas3-T7-RNAP-N-terminal fusion constructs may be containedon separate reporter vectors (e.g., see FIGS. 8A-8D). In yet anotheralternative, one or both of the Cascade-T7-RNAP-C-terminal fusion anddCas3-T7-RNAP-N-terminal fusion constructs may be stably integrated intothe cellular genome.

The reporter gene as envisioned herein is a gene used as an indicator ofproper genome editing and is under the control of a T7 promoter. The T7promoter is the promoter for the bacteriophage T7 RNA polymerase, whichis 19 base pairs in length. Reporter genes are used widely in molecularbiology for gene expression and to study cellular events. Typically, areporter gene is cloned into an expression vector that is thentransformed or transfected into cells. The cells then may be assayed forthe presence of the reporter by directly measuring the reporter proteinitself such as in systems with reporters such as fluorescent proteinssuch as GFP, RFP and BFP, cell surface markers or antibiotic resistance,or by measuring the enzymatic activity of the reporter protein on asubstrate (e.g., luciferase) or measuring fluorescence of a compound inthe presence of a fluorogen such as systems with, e.g., RNA aptamers.

In the present methods and compositions, the reporter gene is under thecontrol of the T7 promoter and the reporter gene is transcribed onlyupon reconstitution of the activity of T7 RNAP via proximity of theCascade-T7-RNAP-C-terminal fusion construct and dCas3-T7-RNAP-N-terminalfusion construct when the Cascade complex recognizes and binds to theedited target locus. Although fluorescent proteins such as greenfluorescent protein and blue fluorescent proteins may be employed, insome embodiments the compositions employ a luciferase reporter assay,which provides increased sensitivity, dynamic range and versatility overfluorescent proteins. Alternatively, RNA aptamers such as spinach andbroccoli can also be expressed as a reporter gene under control of a T7promoter and exhibit fluorescence in the presence of DFHBI((5Z)-5-[(3,5-Difluoro-4-hydroxyphenyl)methylene]-3,5-dihydro-2-methyl-3-(2,2,2-trifluoroethyl)-4H-imidazol-4-one).Bioluminescent reporter assays have the advantage over fluorescentassays in that they deliver 10-1,000-fold higher assay sensitivity. Theluciferase reporter technology as may be employed herein is based on theinteraction of the enzyme luciferase (the coding sequence for which isthe “reporter gene”) and the luminescent substrate luciferin, whichreleases light by the process of bioluminescence. Two commonly-employedluciferases are firefly luciferase, a 61 kDa enzyme which requires nopost-translational modifications, and Renilla luciferase, a 36 kDaenzyme which also requires no post-translational modifications. Bycoupling the T7 promoter to the coding sequence of the luciferase gene,the binding of the Cascade complex with the C-terminal T7 RNAP fusionand the dCas3-N-terminal T7 RNAP to the edited target locus toreconstitute T7 RNAP activity can be detected. Further, as mentionedabove, the reporter gene may comprise an antibiotic resistance gene suchthat edited cells may be identified by antibiotic resistance, or thereporter gene may comprise a cell surface marker protein such that thecells may be sorted via antibodies to the cell surface marker protein.

A fourth component of the split protein reporter system is anedit-discriminating gRNA. The edit-discriminating gRNA is engineeredspecifically to bind to a target sequence that has been properly edited,and not to unedited (e.g., wild type) or incorrectly-edited sequences.In this instance, the edit-discriminating gRNA is not part of an editingcassette comprising a donor DNA (or homology arm). Instead, theedit-discriminating gRNA is used to recruit dCas3 to the Cascadecomplex; that is, the edit-discriminating gRNA is used for sequencerecognition and not to facilitate an edit.

Thus, the nucleic acid-guided nuclease editing components and the splitprotein reporter system components must be transformed or transfectedinto the cell of interest. Transformation is intended to include to avariety of art-recognized techniques for introducing an exogenousnucleic acid sequence (e.g., engine and/or editing vectors) into atarget cell, and the term “transformation” as used herein includes alltransformation and transfection techniques. Such methods include, butare not limited to, electroporation, lipofection, optoporation,injection, microprecipitation, microinjection, liposomes, particlebombardment, sonoporation, laser-induced poration, bead transfection,calcium phosphate or calcium chloride co-precipitation, orDEAE-dextran-mediated transfection. Cells can also be prepared forvector uptake using, e.g., a sucrose, sorbitol or glycerol wash.Additionally, hybrid techniques that exploit the capabilities ofmechanical and chemical transfection methods can be used, e.g.,magnetofection, a transfection methodology that combines chemicaltransfection with mechanical methods. In another example, cationiclipids may be deployed in combination with gene guns or eiectropoiators.Suitable materials and methods for transforming or transfecting targetcells can be found, e.g., in Green and Sambrook, Molecular Cloning: ALaboratory Manual, 4th, ed., Cold Spring Harbor Laboratory Press, ColdSpring Harbor, N.Y., 2014). The present automated methods using theautomated multi-module cell processing instrument utilize flow-throughelectroporation such as the exemplary device shown in FIGS. 5B-5F.

Simultaneously or next, an editing cassette library is designed 106.Methods and compositions for designing and synthesizing editingcassettes are described in USPNs 10,240,167; 10,266,849; 9,982,278;10,351,877; 10,364,442; and 10,435,715; and U.S. Ser. No. 16/275,465,filed 14 Feb. 2019. U.S. Ser. No. 16/275,465, filed 14 Feb. 2019describes compound editing cassettes that are used in some embodimentsof the compositions and methods described herein. Compound editingcassettes are editing cassettes comprising more than one gRNA and morethan one donor DNA. Once designed and synthesized 106, the library ofediting cassettes is amplified, purified and inserted 108 into anediting vector to produce a library of editing vectors. The library ofediting vectors is then transformed into the cells that have alreadybeen transformed with the Cascade-T7-RNAP-C-terminal fusion anddCas3-T7-RNAP-N-terminal fusion constructs 110. In alternativeembodiments, the editing vector, engine vector and reporter vector(s)may be transformed into the cells simultaneously. In yet otherembodiments, one or more of the components for the split proteinreporter system may be integrated into the cellular genome.

Once transformed, the cells are allowed to recover and selectionoptionally is performed to select for cells transformed with thereporter vector(s), engine vector and/or editing vector, all of whichmost often comprise a selectable marker. As described above, drugselectable markers such as ampicillin/carbenicillin, kanamycin,chloramphenicol, nourseothricin N-acetyl transferase, erythromycin,tetracycline, gentamicin, bleomycin, streptomycin, puromycin,hygromycin, blasticidin, and G418 or other selectable markers may beemployed. At a next step, conditions are provided such that editingtakes place 112. For example, if any of the editing components, such as,e.g., one or both of the nuclease or gRNA/donor DNA cassette, are underthe control of an inducible promoter, conditions are provided thatactivate the inducible promoter(s). Once the cells have been edited 112,the cells are selected (e.g., sorted) 114, this time via, e.g.,luminescence. Once the cells have been sorted such that the cells areenriched for edited cells, the edited cells may be used in research ormay be grown to a desired OD to be made electrocompetent again, followedby another round of editing.

FIG. 1B is a simplified schematic of the components of a split proteinsystem where a gRNA does not bind to the wild type (e.g., unedited)genomic target sequence. At top of FIG. 1B is a reporter gene such asluciferase under the control of a T7 promoter. Also seen are anedit-discriminating or edit-targeting gRNA, a Cascade-T7-RNAP-C-terminalfusion construct and a dCas3-C-T7-RNAP-N-terminal fusion construct. Inthis instance the target genomic sequence is a “wild type” or uneditedsequence, which is not recognized by the edit-discriminating gRNA.Because the edit-discriminating gRNA does not recognize the wild typegenomic sequence, the Cascade-T7-RNAP-C-terminal fusion construct andedit-discriminating gRNA fail to form an R-loop complex at the genomiclocus of interest. Without the formation of the R-loop at the genomiclocus of interest, the dCas3-T7-RNAP-N-terminal fusion construct is notrecruited to the locus of interest and the N-terminal and C-terminalportions of T7 RNAP are not brought into proximity. Without theN-terminal and C-terminal portions of T7 RNAP being brought intoproximity, the activity of the T7 RNAP is not reconstituted, the T7promoter is not activated, and the reporter gene is not transcribed (ortranslated). That is, the reporter is silent because the Cascade complexand dCas3 fail to bind the genomic target sequence such that the T7 RNAPmay be “reconstituted” and activated.

FIG. 1C is a simplified schematic of the components of a split proteinsystem where there is an edited genomic target sequence, but the editedgenomic target sequence is not a correct or desired edit. In thisinstance, the edit-discriminating gRNA does not bind to theincorrectly-edited genomic target sequence. At top of FIG. 1C, like FIG.1B, is a reporter gene such as luciferase under the control of a T7promoter. Also seen are a gRNA, a Cascade-T7-RNAP-C-terminal fusionconstruct and a dCas3-T7-RNAP-N-terminal fusion construct. Here, thetarget genomic sequence comprises an “indel” or incorrectly-editedsequence resulting from, e.g., an edit caused by non-homologous endjoining (NHEJ), which repairs the double-strand break in the targetgenome without homologous repair rather than by homologous recombination(HR), which results in a precise, desired edit. The indel (e.g.,incorrect edit), like the wild type genomic target sequence, is notrecognized by the edit-discriminating gRNA. Because theedit-discriminating gRNA does not recognize the incorrectly-editedgenomic sequence, the Cascade-T7-RNAP-C-terminal fusion construct andgRNA fail to form an R-loop complex at the genomic locus. Without theformation of the R-loop complex at the locus of interest, thedCas3-T7-RNAP-N-terminal fusion construct is not recruited to the locusof interest and the N-terminal and C-terminal portions of T7 RNAP arenot brought into proximity. Without the N-terminal and C-terminalportions of T7 RNAP being brought into proximity, the activity of the T7RNAP is not reconstituted, the T7 promoter is not activated, and thereporter gene is not transcribed (or translated). That is, like theprocess depicted in FIG. 1B, the reporter is silent because the Cascadecomplex and dCas3 fail to bind the genomic target sequence such that theT7 RNAP may be “reconstituted” and activated.

FIG. 1D is a simplified schematic of the components of a split proteinsystem where there is a properly-edited genomic target sequence and thereporter gene is activated. At top of FIG. 1D is a reporter gene such asluciferase under the control of a T7 promoter. Also seen are anedit-discriminating gRNA, a Cascade-T7-RNAP-C-terminal fusion constructand a dCas3-T7-RNAP-N-terminal fusion construct bound to a genomictarget sequence. In this instance the target genomic sequence is aproperly-edited sequence, which is recognized by the edit-discriminatinggRNA. Now the edit-discriminating gRNA and Cascade-T7-RNAP-C-terminalfusion construct can form an R-loop complex at the target genomic locusand the dCas3-T7-RNAP-N-terminal fusion construct may now be recruitedto the target genomic locus thereby bringing the N-terminal andC-terminal portions of T7 RNAP into functional proximity. With theN-terminal and C-terminal portions of T7 RNAP being brought intofunctional proximity, the activity of the T7 RNAP is reconstituted andthe T7 promoter is activated, thereby transcribing the reporter genewhich in this embodiment is luciferase. Also seen in this FIG. 1D areschematics illustrating the transcribed (and translated) luciferaseenzyme catalyzing the conversion of D-luciferin (the substrate) intodetectable luminescence.

Automated Cell Editing Instruments and Modules to Perform NucleicAcid-Guided Nuclease Editing in Cells

Automated Cell Editing Instruments

FIG. 2A depicts an exemplary automated multi-module cell processinginstrument 200 to, e.g., perform one of the exemplary workflowscomprising a split protein reporter system as described herein. Theinstrument 200, for example, may be and preferably is designed as astand-alone desktop instrument for use within a laboratory environment.The instrument 200 may incorporate a mixture of reusable and disposablecomponents for performing the various integrated processes in conductingautomated genome cleavage and/or editing in cells without humanintervention. Illustrated is a gantry 202, providing an automatedmechanical motion system (actuator) (not shown) that supplies XYZ axismotion control to, e.g., an automated (i.e., robotic) liquid handlingsystem 258 including, e.g., an air displacement pipettor 232 whichallows for cell processing among multiple modules without humanintervention. In some automated multi-module cell processinginstruments, the air displacement pipettor 232 is moved by gantry 202and the various modules and reagent cartridges remain stationary;however, in other embodiments, the liquid handling system 258 may staystationary while the various modules and reagent cartridges are moved.Also included in the automated multi-module cell processing instrument200 are reagent cartridges 210 comprising reservoirs 212 andtransformation module 230 (e.g., a flow-through electroporation deviceas described in detail in relation to FIGS. 5B-5F), as well as washreservoirs 206, cell input reservoir 251 and cell output reservoir 253.The wash reservoirs 206 may be configured to accommodate large tubes,for example, wash solutions, or solutions that are used often throughoutan iterative process. Although two of the reagent cartridges 210comprise a wash reservoir 206 in FIG. 2A, the wash reservoirs insteadcould be included in a wash cartridge where the reagent and washcartridges are separate cartridges. In such a case, the reagentcartridge 210 and wash cartridge 204 may be identical except for theconsumables (reagents or other components contained within the variousinserts) inserted therein.

In some implementations, the reagent cartridges 210 are disposable kitscomprising reagents and cells for use in the automated multi-module cellprocessing/editing instrument 200. For example, a user may open andposition each of the reagent cartridges 210 comprising various desiredinserts and reagents within the chassis of the automated multi-modulecell editing instrument 200 prior to activating cell processing.Further, each of the reagent cartridges 210 may be inserted intoreceptacles in the chassis having different temperature zonesappropriate for the reagents contained therein.

Also illustrated in FIG. 2A is the robotic liquid handling system 258including the gantry 202 and air displacement pipettor 232. In someexamples, the robotic handling system 258 may include an automatedliquid handling system such as those manufactured by Tecan Group Ltd. ofMannedorf, Switzerland, Hamilton Company of Reno, Nev. (see, e.g.,WO2018015544A1), or Beckman Coulter, Inc. of Fort Collins, Colo. (see,e.g., US20160018427A1). Pipette tips may be provided in a pipettetransfer tip supply (not shown) for use with the air displacementpipettor 232.

Inserts or components of the reagent cartridges 210, in someimplementations, are marked with machine-readable indicia (not shown),such as bar codes, for recognition by the robotic handling system 258.For example, the robotic liquid handling system 258 may scan one or moreinserts within each of the reagent cartridges 210 to confirm contents.In other implementations, machine-readable indicia may be marked uponeach reagent cartridge 210, and a processing system (not shown, but seeelement 237 of FIG. 2B) of the automated multi-module cell editinginstrument 200 may identify a stored materials map based upon themachine-readable indicia. In the embodiment illustrated in FIG. 2A, acell growth module comprises a cell growth vial 218 (described ingreater detail below in relation to FIGS. 3A-3D). Additionally seen isthe TFF module 222 (described above in detail in relation to FIGS.4A-4E). Also illustrated as part of the automated multi-module cellprocessing instrument 200 of FIG. 2A is a singulation module 240 (e.g.,a solid wall isolation, incubation and normalization device (SWIINdevice) is shown here) described herein in relation to FIGS. 6C-6F,served by, e.g., robotic liquid handing system 258 and air displacementpipettor 232. Additionally seen is a selection module 220. Also note theplacement of three heatsinks 255.

FIG. 2B is a simplified representation of the contents of the exemplarymulti-module cell processing instrument 200 depicted in FIG. 2A.Cartridge-based source materials (such as in reagent cartridges 210),for example, may be positioned in designated areas on a deck of theinstrument 200 for access by an air displacement pipettor 232. The deckof the multi-module cell processing instrument 200 may include aprotection sink such that contaminants spilling, dripping, oroverflowing from any of the modules of the instrument 200 are containedwithin a lip of the protection sink. Also seen are reagent cartridges210, which are shown disposed with thermal assemblies 211 which cancreate temperature zones appropriate for different regions. Note thatone of the reagent cartridges also comprises a flow-throughelectroporation device 230 (FTEP), served by FTEP interface (e.g.,manifold arm) and actuator 231. Also seen is TFF module 222 withadjacent thermal assembly 225, where the TFF module is served by TFFinterface (e.g., manifold arm) and actuator 233. Thermal assemblies225,235, and 245 encompass thermal electric devices such as Peltierdevices, as well as heatsinks, fans and coolers. The rotating growthvial 218 is within a growth module 234, where the growth module isserved by two thermal assemblies 235. Selection module is seen at 220.Also seen is the SWIIN module 240, comprising a SWIIN cartridge 241,where the SWIIN module also comprises a thermal assembly 245,illumination 243 (in this embodiment, backlighting), evaporation andcondensation control 249, and where the SWIIN module is served by SWIINinterface (e.g., manifold arm) and actuator 247. Also seen in this viewis touch screen display 201, display actuator 203, illumination 205 (oneon either side of multi-module cell processing instrument 200), andcameras 239 (one illumination device on either side of multi-module cellprocessing instrument 200). Finally, element 237 comprises electronics,such as circuit control boards, high-voltage amplifiers, power supplies,and power entry; as well as pneumatics, such as pumps, valves andsensors.

FIG. 2C illustrates a front perspective view of multi-module cellprocessing instrument 200 for use in as a desktop version of theautomated multi-module cell editing instrument 200. For example, achassis 290 may have a width of about 24-48 inches, a height of about24-48 inches and a depth of about 24-48 inches. Chassis 290 may be andpreferably is designed to hold all modules and disposable supplies usedin automated cell processing and to perform all processes requiredwithout human intervention; that is, chassis 290 is configured toprovide an integrated, stand-alone automated multi-module cellprocessing instrument. As illustrated in FIG. 2C, chassis 290 includestouch screen display 201, cooling grate 264, which allows for air flowvia an internal fan (not shown). The touch screen display providesinformation to a user regarding the processing status of the automatedmulti-module cell editing instrument 200 and accepts inputs from theuser for conducting the cell processing. In this embodiment, the chassis290 is lifted by adjustable feet 270 a, 270 b, 270 c and 270 d (feet 270a-270 c are shown in this FIG. 2C). Adjustable feet 270 a-270 d, forexample, allow for additional air flow beneath the chassis 290.

Inside the chassis 290, in some implementations, will be most or all ofthe components described in relation to FIGS. 2A and 2B, including therobotic liquid handling system disposed along a gantry, reagentcartridges 210 including a flow-through electroporation device, arotating growth vial 218 in a cell growth module 234, a tangential flowfiltration module 222, a SWIIN module 240 as well as interfaces andactuators for the various modules. In addition, chassis 290 housescontrol circuitry, liquid handling tubes, air pump controls, valves,sensors, thermal assemblies (e.g., heating and cooling units) and othercontrol mechanisms. For examples of multi-module cell editinginstruments, see U.S. Pat. No. 10,253,316, issued 9 Apr. 2019; U.S. Pat.No. 10,329,559, issued 25 Jun. 2019; U.S. Pat. No. 10,323,242, issued 18Jun. 2019; U.S. Pat. No. 10,421,959, issued 24 Sep. 2019; U.S. Pat. No.10,465,185, issued 5 Nov. 2019; and U.S. Ser. No. 16/412,195, filed 14May 2019; Ser. No. 16/571,091, filed 14 Sep. 2019; and Ser. No.16/666,964, filed 29 Oct. 2019, all of which are herein incorporated byreference in their entirety.

The Rotating Cell Growth Module

FIG. 3A shows one embodiment of a rotating growth vial 300 for use withthe cell growth device and in the automated multi-module cell processinginstruments described herein. The rotating growth vial 300 is anoptically-transparent container having an open end 304 for receivingliquid media and cells, a central vial region 306 that defines theprimary container for growing cells, a tapered-to-constricted region 318defining at least one light path 310, a closed end 316, and a driveengagement mechanism 312. The rotating growth vial 300 has a centrallongitudinal axis 320 around which the vial rotates, and the light path310 is generally perpendicular to the longitudinal axis of the vial. Thefirst light path 310 is positioned in the lower constricted portion ofthe tapered-to-constricted region 318. Optionally, some embodiments ofthe rotating growth vial 300 have a second light path 308 in the taperedregion of the tapered-to-constricted region 318. Both light paths inthis embodiment are positioned in a region of the rotating growth vialthat is constantly filled with the cell culture (cells+growth media) andare not affected by the rotational speed of the growth vial. The firstlight path 310 is shorter than the second light path 308 allowing forsensitive measurement of OD values when the OD values of the cellculture in the vial are at a high level (e.g., later in the cell growthprocess), whereas the second light path 308 allows for sensitivemeasurement of OD values when the OD values of the cell culture in thevial are at a lower level (e.g., earlier in the cell growth process).

The drive engagement mechanism 312 engages with a motor (not shown) torotate the vial. In some embodiments, the motor drives the driveengagement mechanism 312 such that the rotating growth vial 300 isrotated in one direction only, and in other embodiments, the rotatinggrowth vial 300 is rotated in a first direction for a first amount oftime or periodicity, rotated in a second direction (i.e., the oppositedirection) for a second amount of time or periodicity, and this processmay be repeated so that the rotating growth vial 300 (and the cellculture contents) are subjected to an oscillating motion. Further, thechoice of whether the culture is subjected to oscillation and theperiodicity therefor may be selected by the user. The first amount oftime and the second amount of time may be the same or may be different.The amount of time may be 1, 2, 3, 4, 5, or more seconds, or may be 1,2, 3, 4 or more minutes. In another embodiment, in an early stage ofcell growth the rotating growth vial 400 may be oscillated at a firstperiodicity (e.g., every 60 seconds), and then a later stage of cellgrowth the rotating growth vial 300 may be oscillated at a secondperiodicity (e.g., every one second) different from the firstperiodicity.

The rotating growth vial 300 may be reusable or, preferably, therotating growth vial is consumable. In some embodiments, the rotatinggrowth vial is consumable and is presented to the user pre-filled withgrowth medium, where the vial is hermetically sealed at the open end 304with a foil seal. A medium-filled rotating growth vial packaged in sucha manner may be part of a kit for use with a stand-alone cell growthdevice or with a cell growth module that is part of an automatedmulti-module cell processing system. To introduce cells into the vial, auser need only pipette up a desired volume of cells and use the pipettetip to punch through the foil seal of the vial. Open end 304 mayoptionally include an extended lip 402 to overlap and engage with thecell growth device. In automated systems, the rotating growth vial 400may be tagged with a barcode or other identifying means that can be readby a scanner or camera (not shown) that is part of the automated system.

The volume of the rotating growth vial 300 and the volume of the cellculture (including growth medium) may vary greatly, but the volume ofthe rotating growth vial 300 must be large enough to generate aspecified total number of cells. In practice, the volume of the rotatinggrowth vial 400 may range from 1-250 mL, 2-100 mL, from 5-80 mL, 10-50mL, or from 12-35 mL. Likewise, the volume of the cell culture(cells+growth media) should be appropriate to allow proper aeration andmixing in the rotating growth vial 400. Proper aeration promotes uniformcellular respiration within the growth media. Thus, the volume of thecell culture should be approximately 5-85% of the volume of the growthvial or from 20-60% of the volume of the growth vial. For example, for a30 mL growth vial, the volume of the cell culture would be from about1.5 mL to about 26 mL, or from 6 mL to about 18 mL.

The rotating growth vial 300 preferably is fabricated from abio-compatible optically transparent material—or at least the portion ofthe vial comprising the light path(s) is transparent. Additionally,material from which the rotating growth vial is fabricated should beable to be cooled to about 4° C. or lower and heated to about 55° C. orhigher to accommodate both temperature-based cell assays and long-termstorage at low temperatures. Further, the material that is used tofabricate the vial must be able to withstand temperatures up to 55° C.without deformation while spinning. Suitable materials include cyclicolefin copolymer (COC), glass, polyvinyl chloride, polyethylene,polyamide, polypropylene, polycarbonate, poly(methyl methacrylate(PMMA), polysulfone, polyurethane, and co-polymers of these and otherpolymers. Preferred materials include polypropylene, polycarbonate, orpolystyrene. In some embodiments, the rotating growth vial isinexpensively fabricated by, e.g., injection molding or extrusion.

FIG. 3B is a perspective view of one embodiment of a cell growth device330. FIG. 3C depicts a cut-away view of the cell growth device 330 fromFIG. 3B. In both figures, the rotating growth vial 300 is seenpositioned inside a main housing 336 with the extended lip 302 of therotating growth vial 300 extending above the main housing 336.Additionally, end housings 352, a lower housing 332 and flanges 334 areindicated in both figures. Flanges 334 are used to attach the cellgrowth device 330 to heating/cooling means or other structure (notshown). FIG. 3C depicts additional detail. In FIG. 3C, upper bearing 342and lower bearing 340 are shown positioned within main housing 336.Upper bearing 342 and lower bearing 340 support the vertical load ofrotating growth vial 300. Lower housing 332 contains the drive motor338. The cell growth device 330 of FIG. 3C comprises two light paths: aprimary light path 344, and a secondary light path 350. Light path 344corresponds to light path 310 positioned in the constricted portion ofthe tapered-to-constricted portion of the rotating growth vial 300, andlight path 350 corresponds to light path 308 in the tapered portion ofthe tapered-to-constricted portion of the rotating growth via 316. Lightpaths 310 and 308 are not shown in FIG. 3C but may be seen in FIG. 3A.In addition to light paths 344 and 340, there is an emission board 348to illuminate the light path(s), and detector board 346 to detect thelight after the light travels through the cell culture liquid in therotating growth vial 300.

The motor 338 engages with drive mechanism 312 and is used to rotate therotating growth vial 300. In some embodiments, motor 338 is a brushlessDC type drive motor with built-in drive controls that can be set to holda constant revolution per minute (RPM) between 0 and about 3000 RPM.Alternatively, other motor types such as a stepper, servo, brushed DC,and the like can be used. Optionally, the motor 338 may also havedirection control to allow reversing of the rotational direction, and atachometer to sense and report actual RPM. The motor is controlled by aprocessor (not shown) according to, e.g., standard protocols programmedinto the processor and/or user input, and the motor may be configured tovary RPM to cause axial precession of the cell culture thereby enhancingmixing, e.g., to prevent cell aggregation, increase aeration, andoptimize cellular respiration.

Main housing 336, end housings 352 and lower housing 332 of the cellgrowth device 330 may be fabricated from any suitable, robust materialincluding aluminum, stainless steel, and other thermally conductivematerials, including plastics. These structures or portions thereof canbe created through various techniques, e.g., metal fabrication,injection molding, creation of structural layers that are fused, etc.Whereas the rotating growth vial 300 is envisioned in some embodimentsto be reusable, but preferably is consumable, the other components ofthe cell growth device 330 are preferably reusable and function as astand-alone benchtop device or as a module in a multi-module cellprocessing system.

The processor (not shown) of the cell growth device 330 may beprogrammed with information to be used as a “blank” or control for thegrowing cell culture. A “blank” or control is a vessel containing cellgrowth medium only, which yields 100% transmittance and 0 OD, while thecell sample will deflect light rays and will have a lower percenttransmittance and higher OD. As the cells grow in the media and becomedenser, transmittance will decrease and OD will increase. The processor(not shown) of the cell growth device 330—may be programmed to usewavelength values for blanks commensurate with the growth mediatypically used in cell culture (whether, e.g., mammalian cells,bacterial cells, animal cells, yeast cells, etc.). Alternatively, asecond spectrophotometer and vessel may be included in the cell growthdevice 330, where the second spectrophotometer is used to read a blankat designated intervals.

FIG. 3D illustrates a cell growth device 330 as part of an assemblycomprising the cell growth device 330 of FIG. 3B coupled to light source390, detector 392, and thermal components 394. The rotating growth vial300 is inserted into the cell growth device. Components of the lightsource 390 and detector 392 (e.g., such as a photodiode with gaincontrol to cover 5-log) are coupled to the main housing of the cellgrowth device. The lower housing 332 that houses the motor that rotatesthe rotating growth vial 300 is illustrated, as is one of the flanges334 that secures the cell growth device 330 to the assembly. Also, thethermal components 394 illustrated are a Peltier device orthermoelectric cooler. In this embodiment, thermal control isaccomplished by attachment and electrical integration of the cell growthdevice 330 to the thermal components 394 via the flange 334 on the baseof the lower housing 332. Thermoelectric coolers are capable of“pumping” heat to either side of a junction, either cooling a surface orheating a surface depending on the direction of current flow. In oneembodiment, a thermistor is used to measure the temperature of the mainhousing and then, through a standard electronicproportional-integral-derivative (PID) controller loop, the rotatinggrowth vial 300 is controlled to approximately +/−0.5° C.

In use, cells are inoculated (cells can be pipetted, e.g., from anautomated liquid handling system or by a user) into pre-filled growthmedia of a rotating growth vial 300 by piercing though the foil seal orfilm. The programmed software of the cell growth device 330 sets thecontrol temperature for growth, typically 30° C., then slowly starts therotation of the rotating growth vial 300. The cell/growth media mixtureslowly moves vertically up the wall due to centrifugal force allowingthe rotating growth vial 300 to expose a large surface area of themixture to a normal oxygen environment. The growth monitoring systemtakes either continuous readings of the OD or OD measurements at pre-setor pre-programmed time intervals. These measurements are stored ininternal memory and if requested the software plots the measurementsversus time to display a growth curve. If enhanced mixing is required,e.g., to optimize growth conditions, the speed of the vial rotation canbe varied to cause an axial precession of the liquid, and/or a completedirectional change can be performed at programmed intervals. The growthmonitoring can be programmed to automatically terminate the growth stageat a pre-determined OD, and then quickly cool the mixture to a lowertemperature to inhibit further growth.

One application for the cell growth device 330 is to constantly measurethe optical density of a growing cell culture. One advantage of thedescribed cell growth device is that optical density can be measuredcontinuously (kinetic monitoring) or at specific time intervals; e.g.,every 5, 10, 15, 20, 30 45, or 60 seconds, or every 1, 2, 3, 4, 5, 6, 7,8, 9, or 10 minutes. While the cell growth device 330 has been describedin the context of measuring the optical density (OD) of a growing cellculture, it should, however, be understood by a skilled artisan giventhe teachings of the present specification that other cell growthparameters can be measured in addition to or instead of cell culture OD.As with optional measure of cell growth in relation to the solid walldevice or module described supra, spectroscopy using visible, UV, ornear infrared (NIR) light allows monitoring the concentration ofnutrients and/or wastes in the cell culture and other spectroscopicmeasurements may be made; that is, other spectral properties can bemeasured via, e.g., dielectric impedance spectroscopy, visiblefluorescence, fluorescence polarization, or luminescence. Additionally,the cell growth device 430 may include additional sensors for measuring,e.g., dissolved oxygen, carbon dioxide, pH, conductivity, and the like.For additional details regarding rotating growth vials and cell growthdevices see U.S. Ser. No. 16/360,404, filed 21 Mar. 2019 and Ser. No.16/360,423, filed 21 Mar. 2019.

The Cell Concentration Module

As described above in relation to the rotating growth vial and cellgrowth module, in order to obtain an adequate number of cells fortransformation or transfection, cells typically are grown to a specificoptical density in medium appropriate for the growth of the cells ofinterest; however, for effective transformation or transfection, it isdesirable to decrease the volume of the cells as well as render thecells competent via buffer or medium exchange. Thus, one sub-componentor module that is desired in cell processing systems for the processeslisted above is a module or component that can grow, perform bufferexchange, and/or concentrate cells and render them competent so thatthey may be transformed or transfected with the nucleic acids needed forengineering or editing the cell's genome.

FIG. 4A shows a retentate member 422 (top), permeate member 420 (middle)and a tangential flow assembly 410 (bottom) comprising the retentatemember 422, membrane 424 (not seen in FIG. 4A), and permeate member 420(also not seen). In FIG. 4A, retentate member 422 comprises a tangentialflow channel 402, which has a serpentine configuration that initiates atone lower corner of retentate member 422—specifically at retentate port428—traverses across and up then down and across retentate member 422,ending in the other lower corner of retentate member 422 at a secondretentate port 428. Also seen on retentate member 422 are energydirectors 491, which circumscribe the region where a membrane or filter(not seen in this FIG. 4A) is seated, as well as interdigitate betweenareas of channel 402. Energy directors 491 in this embodiment mate withand serve to facilitate ultrasonic welding or bonding of retentatemember 422 with permeate/filtrate member 420 via the energy directorcomponent 491 on permeate/filtrate member 420 (at right). Additionally,countersinks 423 can be seen, two on the bottom one at the top middle ofretentate member 422. Countersinks 423 are used to couple and tangentialflow assembly 410 to a reservoir assembly (not seen in this FIG. 4A butsee FIG. 4B).

Permeate/filtrate member 420 is seen in the middle of FIG. 4A andcomprises, in addition to energy director 491, through-holes forretentate ports 428 at each bottom corner (which mate with thethrough-holes for retentate ports 428 at the bottom corners of retentatemember 422), as well as a tangential flow channel 402 and twopermeate/filtrate ports 426 positioned at the top and center of permeatemember 420. The tangential flow channel 402 structure in this embodimenthas a serpentine configuration and an undulating geometry, althoughother geometries may be used. Permeate member 420 also comprisescountersinks 423, coincident with the countersinks 423 on retentatemember 420.

On the left of FIG. 4A is a tangential flow assembly 410 comprising theretentate member 422 and permeate member 420 seen in this FIG. 4A. Inthis view, retentate member 422 is “on top” of the view, a membrane (notseen in this view of the assembly) would be adjacent and under retentatemember 422 and permeate member 420 (also not seen in this view of theassembly) is adjacent to and beneath the membrane. Again countersinks423 are seen, where the countersinks in the retentate member 422 and thepermeate member 420 are coincident and configured to mate with threadsor mating elements for the countersinks disposed on a reservoir assembly(not seen in FIG. 4A but see FIG. 4B).

A membrane or filter is disposed between the retentate and permeatemembers, where fluids can flow through the membrane but cells cannot andare thus retained in the flow channel disposed in the retentate member.Filters or membranes appropriate for use in the TFF device/module arethose that are solvent resistant, are contamination free duringfiltration, and are able to retain the types and sizes of cells ofinterest. For example, in order to retain small cell types such asbacterial cells, pore sizes can be as low as 0.2 μm, however for othercell types, the pore sizes can be as high as 20 μm. Indeed, the poresizes useful in the TFF device/module include filters with sizes from0.20 μm, 0.21 μm, 0.22 μm, 0.23 μm, 0.24 μm, 0.25 μm, 0.26 μm, 0.27 μm,0.28 μm, 0.29 μm, 0.30 μm, 0.31 μm, 0.32 μm, 0.33 μm, 0.34 μm, 0.35 μm,0.36 μm, 0.37 μm, 0.38 μm, 0.39 μm, 0.40 μm, 0.41 μm, 0.42 μm, 0.43 μm,0.44 μm, 0.45 μm, 0.46 μm, 0.47 μm, 0.48 μm, 0.49 μm, 0.50 μm andlarger. The filters may be fabricated from any suitable non-reactivematerial including cellulose mixed ester (cellulose nitrate and acetate)(CME), polycarbonate (PC), polyvinylidene fluoride (PVDF),polyethersulfone (PES), polytetrafluoroethylene (PTFE), nylon, glassfiber, or metal substrates as in the case of laser or electrochemicaletching.

The length of the channel structure 402 may vary depending on the volumeof the cell culture to be grown and the optical density of the cellculture to be concentrated. The length of the channel structuretypically is from 60 mm to 300 mm, or from 70 mm to 200 mm, or from 80mm to 100 mm. The cross-section configuration of the flow channel 402may be round, elliptical, oval, square, rectangular, trapezoidal, orirregular. If square, rectangular, or another shape with generallystraight sides, the cross section may be from about 10 μm to 1000 μmwide, or from 200 μm to 800 μm wide, or from 300 μm to 700 μm wide, orfrom 400 μm to 600 μm wide; and from about 10 μm to 1000 μm high, orfrom 200 μm to 800 μm high, or from 300 μm to 700 μm high, or from 400μm to 600 μm high. If the cross section of the flow channel 102 isgenerally round, oval or elliptical, the radius of the channel may befrom about 50 μm to 1000 μm in hydraulic radius, or from 5 μm to 800 μmin hydraulic radius, or from 200 μm to 700 μm in hydraulic radius, orfrom 300 μm to 600 μm wide in hydraulic radius, or from about 200 to 500μm in hydraulic radius. Moreover, the volume of the channel in theretentate 422 and permeate 420 members may be different depending on thedepth of the channel in each member.

FIG. 4B shows front perspective (right) and rear perspective (left)views of a reservoir assembly 450 configured to be used with thetangential flow assembly 410 seen in FIG. 4A. Seen in the frontperspective view (e.g., “front” being the side of reservoir assembly 450that is coupled to the tangential flow assembly 410 seen in FIG. 4A) areretentate reservoirs 452 on either side of permeate reservoir 454. Alsoseen are permeate ports 426, retentate ports 428, and three threads ormating elements 425 for countersinks 423 (countersinks 423 not seen inthis FIG. 4B). Threads or mating elements 425 for countersinks 423 areconfigured to mate or couple the tangential flow assembly 410 (seen inFIG. 4A) to reservoir assembly 450. Alternatively or in addition,fasteners, sonic welding or heat stakes may be used to mate or couplethe tangential flow assembly 410 to reservoir assembly 450. In additionis seen gasket 445 covering the top of reservoir assembly 450. Gasket445 is described in detail in relation to FIG. 4E. At left in FIG. 4B isa rear perspective view of reservoir assembly 1250, where “rear” is theside of reservoir assembly 450 that is not coupled to the tangentialflow assembly. Seen are retentate reservoirs 452, permeate reservoir454, and gasket 445.

The TFF device may be fabricated from any robust material in whichchannels (and channel branches) may be milled including stainless steel,silicon, glass, aluminum, or plastics including cyclic-olefin copolymer(COC), cyclo-olefin polymer (COP), polystyrene, polyvinyl chloride,polyethylene, polyamide, polyethylene, polypropylene, acrylonitrilebutadiene, polycarbonate, polyetheretheketone (PEEK), poly(methylmethylacrylate) (PMMA), polysulfone, and polyurethane, and co-polymersof these and other polymers. If the TFF device/module is disposable,preferably it is made of plastic. In some embodiments, the material usedto fabricate the TFF device/module is thermally-conductive so that thecell culture may be heated or cooled to a desired temperature. Incertain embodiments, the TFF device is formed by precision mechanicalmachining, laser machining, electro discharge machining (for metaldevices); wet or dry etching (for silicon devices); dry or wet etching,powder or sandblasting, photostructuring (for glass devices); orthermoforming, injection molding, hot embossing, or laser machining (forplastic devices) using the materials mentioned above that are amenableto this mass production techniques.

FIG. 4C depicts a top-down view of the reservoir assemblies 450 shown inFIG. 4B. FIG. 4D depicts a cover 444 for reservoir assembly 450 shown inFIGS. 4B and 4E depicts a gasket 445 that in operation is disposed oncover 444 of reservoir assemblies 450 shown in FIG. 4B. FIG. 4C is atop-down view of reservoir assembly 450, showing the tops of the tworetentate reservoirs 452, one on either side of permeate reservoir 454.Also seen are grooves 432 that will mate with a pneumatic port (notshown), and fluid channels 434 that reside at the bottom of retentatereservoirs 452, which fluidically couple the retentate reservoirs 452with the retentate ports 428 (not shown), via the through-holes for theretentate ports in permeate member 420 and membrane 424 (also notshown). FIG. 4D depicts a cover 444 that is configured to be disposedupon the top of reservoir assembly 450. Cover 444 has round cut-outs atthe top of retentate reservoirs 452 and permeate/filtrate reservoir 454.Again at the bottom of retentate reservoirs 452 fluid channels 434 canbe seen, where fluid channels 434 fluidically couple retentatereservoirs 452 with the retentate ports 428 (not shown). Also shown arethree pneumatic ports 430 for each retentate reservoir 452 andpermeate/filtrate reservoir 454. FIG. 4E depicts a gasket 445 that isconfigures to be disposed upon the cover 444 of reservoir assembly 450.Seen are three fluid transfer ports 442 for each retentate reservoir 452and for permeate/filtrate reservoir 454. Again, three pneumatic ports430, for each retentate reservoir 452 and for permeate/filtratereservoir 454, are shown.

The overall work flow for cell growth comprises loading a cell cultureto be grown into a first retentate reservoir, optionally bubbling air oran appropriate gas through the cell culture, passing or flowing the cellculture through the first retentate port then tangentially through theTFF channel structure while collecting medium or buffer through one orboth of the permeate ports 406, collecting the cell culture through asecond retentate port 404 into a second retentate reservoir, optionallyadding additional or different medium to the cell culture and optionallybubbling air or gas through the cell culture, then repeating theprocess, all while measuring, e.g., the optical density of the cellculture in the retentate reservoirs continuously or at desiredintervals. Measurements of optical densities (OD) at programmed timeintervals are accomplished using a 600 nm Light Emitting Diode (LED)that has been columnated through an optic into the retentatereservoir(s) containing the growing cells. The light continues through acollection optic to the detection system which consists of a (digital)gain-controlled silicone photodiode. Generally, optical density is shownas the absolute value of the logarithm with base 10 of the powertransmission factors of an optical attenuator: OD=−log 10 (Powerout/Power in). Since OD is the measure of optical attenuation—that is,the sum of absorption, scattering, and reflection—the TFF device ODmeasurement records the overall power transmission, so as the cells growand become denser in population, the OD (the loss of signal) increases.The OD system is pre-calibrated against OD standards with these valuesstored in an on-board memory accessible by the measurement program.

In the channel structure, the membrane bifurcating the flow channelsretains the cells on one side of the membrane (the retentate side 422)and allows unwanted medium or buffer to flow across the membrane into afiltrate or permeate side (e.g., permeate member 420) of the device.Bubbling air or other appropriate gas through the cell culture bothaerates and mixes the culture to enhance cell growth. During theprocess, medium that is removed during the flow through the channelstructure is removed through the permeate/filtrate ports 406.Alternatively, cells can be grown in one reservoir with bubbling oragitation without passing the cells through the TFF channel from onereservoir to the other.

The overall work flow for cell concentration using the TFF device/moduleinvolves flowing a cell culture or cell sample tangentially through thechannel structure. As with the cell growth process, the membranebifurcating the flow channels retains the cells on one side of themembrane and allows unwanted medium or buffer to flow across themembrane into a permeate/filtrate side (e.g., permeate member 420) ofthe device. In this process, a fixed volume of cells in medium or bufferis driven through the device until the cell sample is collected into oneof the retentate ports 404, and the medium/buffer that has passedthrough the membrane is collected through one or both of thepermeate/filtrate ports 406. All types of prokaryotic and eukaryoticcells—both adherent and non-adherent cells—can be grown in the TFFdevice. Adherent cells may be grown on beads or other cell scaffoldssuspended in medium that flow through the TFF device.

The medium or buffer used to suspend the cells in the cell concentrationdevice/module may be any suitable medium or buffer for the type of cellsbeing transformed or transfected, such as LB, SOC, TPD, YPG, YPAD, MEM,DMEM, IMDM, RPMI, Hanks', PBS and Ringer's solution, where the media maybe provided in a reagent cartridge as part of a kit. For culture ofadherent cells, cells may be disposed on beads, microcarriers, or othertype of scaffold suspended in medium. Most normal mammaliantissue-derived cells—except those derived from the hematopoieticsystem—are anchorage dependent and need a surface or cell culturesupport for normal proliferation. In the rotating growth vial describedherein, microcarrier technology is leveraged. Microcarriers ofparticular use typically have a diameter of 100-300 μm and have adensity slightly greater than that of the culture medium (thusfacilitating an easy separation of cells and medium for, e.g., mediumexchange) yet the density must also be sufficiently low to allowcomplete suspension of the carriers at a minimum stirring rate in orderto avoid hydrodynamic damage to the cells. Many different types ofmicrocarriers are available, and different microcarriers are optimizedfor different types of cells. There are positively charged carriers,such as Cytodex 1 (dextran-based, GE Healthcare), DE-52(cellulose-based, Sigma-Aldrich Labware), DE-53 (cellulose-based,Sigma-Aldrich Labware), and HLX 11-170 (polystyrene-based); collagen- orECM- (extracellular matrix) coated carriers, such as Cytodex 3(dextran-based, GE Healthcare) or HyQ-sphere Pro-F 102-4(polystyrene-based, Thermo Scientific); non-charged carriers, likeHyQ-sphere P 102-4 (Thermo Scientific); or macroporous carriers based ongelatin (Cultisphere, Percell Biolytica) or cellulose (Cytopore, GEHealthcare).

In both the cell growth and concentration processes, passing the cellsample through the TFF device and collecting the cells in one of theretentate ports 404 while collecting the medium in one of thepermeate/filtrate ports 406 is considered “one pass” of the cell sample.The transfer between retentate reservoirs “flips” the culture. Theretentate and permeate ports collecting the cells and medium,respectively, for a given pass reside on the same end of TFFdevice/module with fluidic connections arranged so that there are twodistinct flow layers for the retentate and permeate/filtrate sides, butif the retentate port 404 resides on the retentate member ofdevice/module (that is, the cells are driven through the channel abovethe membrane and the filtrate (medium) passes to the portion of thechannel below the membrane), the permeate/filtrate port 406 will resideon the permeate member of device/module and vice versa (that is, if thecell sample is driven through the channel below the membrane, thefiltrate (medium) passes to the portion of the channel above themembrane). Due to the high pressures used to transfer the cell cultureand fluids through the flow channel of the TFF device, the effect ofgravity is negligible.

At the conclusion of a “pass” in either of the growth and concentrationprocesses, the cell sample is collected by passing through the retentateport 404 and into the retentate reservoir (not shown). To initiateanother “pass”, the cell sample is passed again through the TFF device,this time in a flow direction that is reversed from the first pass. Thecell sample is collected by passing through the retentate port 404 andinto retentate reservoir (not shown) on the opposite end of thedevice/module from the retentate port 404 that was used to collect cellsduring the first pass. Likewise, the medium/buffer that passes throughthe membrane on the second pass is collected through the permeate port406 on the opposite end of the device/module from the permeate port 406that was used to collect the filtrate during the first pass, or throughboth ports. This alternating process of passing the retentate (theconcentrated cell sample) through the device/module is repeated untilthe cells have been grown to a desired optical density, and/orconcentrated to a desired volume, and both permeate ports (i.e., ifthere are more than one) can be open during the passes to reduceoperating time. In addition, buffer exchange may be effected by adding adesired buffer (or fresh medium) to the cell sample in the retentatereservoir, before initiating another “pass”, and repeating this processuntil the old medium or buffer is diluted and filtered out and the cellsreside in fresh medium or buffer. Note that buffer exchange and cellgrowth may (and typically do) take place simultaneously, and bufferexchange and cell concentration may (and typically do) take placesimultaneously. For further information and alternative embodiments onTFFs see, e.g., U.S. Ser. No. 62/728,365, filed 7 Sep. 2018; 62/857,599,filed 5 Jun. 2019; and 62/867,415, filed 27 Jun. 2019.

The Cell Transformation Module

FIG. 5A depicts an exemplary combination reagent cartridge andelectroporation device 500 (“cartridge”) that may be used in anautomated multi-module cell processing instrument along with the TFFmodule. In addition, in certain embodiments the material used tofabricate the cartridge is thermally-conductive, as in certainembodiments the cartridge 500 contacts a thermal device (not shown),such as a Peltier device or thermoelectric cooler, that heats or coolsreagents in the reagent reservoirs or reservoirs 504. Reagent reservoirsor reservoirs 504 may be reservoirs into which individual tubes ofreagents are inserted as shown in FIG. 5A, or the reagent reservoirs mayhold the reagents without inserted tubes. Additionally, the reservoirsin a reagent cartridge may be configured for any combination of tubes,co-joined tubes, and direct-fill of reagents.

In one embodiment, the reagent reservoirs or reservoirs 504 of reagentcartridge 500 are configured to hold various size tubes, including,e.g., 250 ml tubes, 25 ml tubes, 10 ml tubes, 5 ml tubes, and Eppendorfor microcentrifuge tubes. In yet another embodiment, all reservoirs maybe configured to hold the same size tube, e.g., 5 ml tubes, andreservoir inserts may be used to accommodate smaller tubes in thereagent reservoir. In yet another embodiment—particularly in anembodiment where the reagent cartridge is disposable—the reagentreservoirs hold reagents without inserted tubes. In this disposableembodiment, the reagent cartridge may be part of a kit, where thereagent cartridge is pre-filled with reagents and the receptacles orreservoirs sealed with, e.g., foil, heat seal acrylic or the like andpresented to a consumer where the reagent cartridge can then be used inan automated multi-module cell processing instrument. As one of ordinaryskill in the art will appreciate given the present disclosure, thereagents contained in the reagent cartridge will vary depending on workflow; that is, the reagents will vary depending on the processes towhich the cells are subjected in the automated multi-module cellprocessing instrument, e.g., protein production, cell transformation andculture, cell editing, etc.

Reagents such as cell samples, enzymes, buffers, nucleic acid vectors,expression cassettes, proteins or peptides, reaction components (suchas, e.g., MgCl₂, dNTPs, nucleic acid assembly reagents, gap repairreagents, and the like), wash solutions, ethanol, and magnetic beads fornucleic acid purification and isolation, etc. may be positioned in thereagent cartridge at a known position. In some embodiments of cartridge500, the cartridge comprises a script (not shown) readable by aprocessor (not shown) for dispensing the reagents. Also, the cartridge500 as one component in an automated multi-module cell processinginstrument may comprise a script specifying two, three, four, five, tenor more processes to be performed by the automated multi-module cellprocessing instrument. In certain embodiments, the reagent cartridge isdisposable and is pre-packaged with reagents tailored to performingspecific cell processing protocols, e.g., genome editing or proteinproduction. Because the reagent cartridge contents vary whilecomponents/modules of the automated multi-module cell processinginstrument or system may not, the script associated with a particularreagent cartridge matches the reagents used and cell processesperformed. Thus, e.g., reagent cartridges may be pre-packaged withreagents for genome editing and a script that specifies the processsteps for performing genome editing in an automated multi-module cellprocessing instrument, or, e.g., reagents for protein expression and ascript that specifies the process steps for performing proteinexpression in an automated multi-module cell processing instrument.

For example, the reagent cartridge may comprise a script to pipettecompetent cells from a reservoir, transfer the cells to a transformationmodule, pipette a nucleic acid solution comprising a vector withexpression cassette from another reservoir in the reagent cartridge,transfer the nucleic acid solution to the transformation module,initiate the transformation process for a specified time, then move thetransformed cells to yet another reservoir in the reagent cassette or toanother module such as a cell growth module in the automatedmulti-module cell processing instrument. In another example, the reagentcartridge may comprise a script to transfer a nucleic acid solutioncomprising a vector from a reservoir in the reagent cassette, nucleicacid solution comprising editing oligonucleotide cassettes in areservoir in the reagent cassette, and a nucleic acid assembly mix fromanother reservoir to the nucleic acid assembly/desalting module, ifpresent. The script may also specify process steps performed by othermodules in the automated multi-module cell processing instrument. Forexample, the script may specify that the nucleic acid assembly/desaltingreservoir be heated to 50° C. for 30 min to generate an assembledproduct; and desalting and resuspension of the assembled product viamagnetic bead-based nucleic acid purification involving a series ofpipette transfers and mixing of magnetic beads, ethanol wash, andbuffer.

As described in relation to FIGS. 5B and 5C below, the exemplary reagentcartridges for use in the automated multi-module cell processinginstruments may include one or more electroporation devices, preferablyflow-through electroporation (FTEP) devices. In yet other embodiments,the reagent cartridge is separate from the transformation module.Electroporation is a widely-used method for permeabilization of cellmembranes that works by temporarily generating pores in the cellmembranes with electrical stimulation. Applications of electroporationinclude the delivery of DNA, RNA, siRNA, peptides, proteins, antibodies,drugs or other substances to a variety of cells such as mammalian cells(including human cells), plant cells, archea, yeasts, other eukaryoticcells, bacteria, and other cell types. Electrical stimulation may alsobe used for cell fusion in the production of hybridomas or other fusedcells. During a typical electroporation procedure, cells are suspendedin a buffer or medium that is favorable for cell survival. For bacterialcell electroporation, low conductance mediums, such as water, glycerolsolutions and the like, are often used to reduce the heat production bytransient high current. In traditional electroporation devices, thecells and material to be electroporated into the cells (collectively“the cell sample”) are placed in a cuvette embedded with two flatelectrodes for electrical discharge. For example, Bio-Rad (Hercules,Calif.) makes the GENE PULSER XCELL™ line of products to electroporatecells in cuvettes. Traditionally, electroporation requires high fieldstrength; however, the flow-through electroporation devices included inthe reagent cartridges achieve high efficiency cell electroporation withlow toxicity. The reagent cartridges of the disclosure allow forparticularly easy integration with robotic liquid handlinginstrumentation that is typically used in automated instruments andsystems such as air displacement pipettors. Such automatedinstrumentation includes, but is not limited to, off-the-shelf automatedliquid handling systems from Tecan (Mannedorf, Switzerland), Hamilton(Reno, Nev.), Beckman Coulter (Fort Collins, Colo.), etc.

FIGS. 5B and 5C are top perspective and bottom perspective views,respectively, of an exemplary FTEP device 550 that may be part of (e.g.,a component in) reagent cartridge 500 in FIG. 5A or may be a stand-alonemodule; that is, not a part of a reagent cartridge or other module. FIG.5B depicts an FTEP device 550. The FTEP device 550 has wells that definecell sample inlets 552 and cell sample outlets 554. FIG. 5C is a bottomperspective view of the FTEP device 550 of FIG. 5B. An inlet well 552and an outlet well 554 can be seen in this view. Also seen in FIG. 5Care the bottom of an inlet 562 corresponding to well 552, the bottom ofan outlet 564 corresponding to the outlet well 554, the bottom of adefined flow channel 566 and the bottom of two electrodes 568 on eitherside of flow channel 566. The FTEP devices may comprise push-pullpneumatic means to allow multi-pass electroporation procedures; that is,cells to electroporated may be “pulled” from the inlet toward the outletfor one pass of electroporation, then be “pushed” from the outlet end ofthe FTEP device toward the inlet end to pass between the electrodesagain for another pass of electroporation. Further, this process may berepeated one to many times. For additional information regarding FTEPdevices, see, e.g., U.S. Ser. No. 16/147,120, filed 28 Sep. 2018; Ser.No. 16/147,353, filed 28 Sep. 2018; Ser. No. 16/426,310, filed 30 May2019; and Ser. No. 16/147,871, filed 30 Sep. 2018; and U.S. Pat. No.10,323,258, issued 18 Jun. 2019. Further, other embodiments of thereagent cartridge may provide or accommodate electroporation devicesthat are not configured as FTEP devices, such as those described in U.S.Ser. No. 16/109,156, filed 22 Aug. 2018. For reagent cartridges usefulin the present automated multi-module cell processing instruments, see,e.g., U.S. Pat. No. 10,376,889, issued 13 Aug. 2019; and USSN16,451,601, filed 25 Jun. 2019.

Additional details of the FTEP devices are illustrated in FIGS. 5D-5F.Note that in the FTEP devices in FIGS. 5D-5F the electrodes are placedsuch that a first electrode is placed between an inlet and a narrowedregion of the flow channel, and the second electrode is placed betweenthe narrowed region of the flow channel and an outlet. FIG. 5D shows atop planar view of an FTEP device 550 having an inlet 552 forintroducing a fluid containing cells and exogenous material into FTEPdevice 550 and an outlet 554 for removing the transformed cells from theFTEP following electroporation. The electrodes 568 are introducedthrough channels (not shown) in the device. FIG. 5E shows a cutaway viewfrom the top of the FTEP device 550, with the inlet 552, outlet 554, andelectrodes 568 positioned with respect to a flow channel 566. FIG. 5Fshows a side cutaway view of FTEP device 550 with the inlet 552 andinlet channel 572, and outlet 554 and outlet channel 574. The electrodes568 are positioned in electrode channels 576 so that they are in fluidcommunication with the flow channel 566, but not directly in the path ofthe cells traveling through the flow channel 566. Note that the firstelectrode is placed between the inlet and the narrowed region of theflow channel, and the second electrode is placed between the narrowedregion of the flow channel and the outlet. The electrodes 568 in thisaspect of the device are positioned in the electrode channels 576 whichare generally perpendicular to the flow channel 566 such that the fluidcontaining the cells and exogenous material flows from the inlet channel572 through the flow channel 566 to the outlet channel 574, and in theprocess fluid flows into the electrode channels 376 to be in contactwith the electrodes 568. In this aspect, the inlet channel, outletchannel and electrode channels all originate from the same planar sideof the device. In certain aspects, however, the electrodes may beintroduced from a different planar side of the FTEP device than theinlet and outlet channels.

In the FTEP devices of the disclosure, the toxicity level of thetransformation results in greater than 30% viable cells afterelectroporation, preferably greater than 35%, 40%, 45%, 50%, 55%, 60%,70%, 75%, 80%, 85%, 90%, 95% or even 99% viable cells followingtransformation, depending on the cell type and the nucleic acids beingintroduced into the cells.

The housing of the FTEP device can be made from many materials dependingon whether the FTEP device is to be reused, autoclaved, or isdisposable, including stainless steel, silicon, glass, resin, polyvinylchloride, polyethylene, polyamide, polystyrene, polyethylene,polypropylene, acrylonitrile butadiene, polycarbonate,polyetheretheketone (PEEK), polysulfone and polyurethane, co-polymers ofthese and other polymers. Similarly, the walls of the channels in thedevice can be made of any suitable material including silicone, resin,glass, glass fiber, polyvinyl chloride, polyethylene, polyamide,polyethylene, polypropylene, acrylonitrile butadiene, polycarbonate,polyetheretheketone (PEEK), polysulfone and polyurethane, co-polymers ofthese and other polymers. Preferred materials include crystal styrene,cyclo-olefin polymer (COP) and cyclic olephin co-polymers (COC), whichallow the device to be formed entirely by injection molding in one piecewith the exception of the electrodes and, e.g., a bottom sealing film ifpresent.

The FTEP devices described herein (or portions of the FTEP devices) canbe created or fabricated via various techniques, e.g., as entire devicesor by creation of structural layers that are fused or otherwise coupled.For example, for metal FTEP devices, fabrication may include precisionmechanical machining or laser machining; for silicon FTEP devices,fabrication may include dry or wet etching; for glass FTEP devices,fabrication may include dry or wet etching, powderblasting,sandblasting, or photostructuring; and for plastic FTEP devicesfabrication may include thermoforming, injection molding, hot embossing,or laser machining. The components of the FTEP devices may bemanufactured separately and then assembled, or certain components of theFTEP devices (or even the entire FTEP device except for the electrodes)may be manufactured (e.g., using 3D printing) or molded (e.g., usinginjection molding) as a single entity, with other components added aftermolding. For example, housing and channels may be manufactured or moldedas a single entity, with the electrodes later added to form the FTEPunit. Alternatively, the FTEP device may also be formed in two or moreparallel layers, e.g., a layer with the horizontal channel and filter, alayer with the vertical channels, and a layer with the inlet and outletports, which are manufactured and/or molded individually and assembledfollowing manufacture.

In specific aspects, the FTEP device can be manufactured using a circuitboard as a base, with the electrodes, filter and/or the flow channelformed in the desired configuration on the circuit board, and theremaining housing of the device containing, e.g., the one or more inletand outlet channels and/or the flow channel formed as a separate layerthat is then sealed onto the circuit board. The sealing of the top ofthe housing onto the circuit board provides the desired configuration ofthe different elements of the FTEP devices of the disclosure. Also, twoto many FTEP devices may be manufactured on a single substrate, thenseparated from one another thereafter or used in parallel. In certainembodiments, the FTEP devices are reusable and, in some embodiments, theFTEP devices are disposable. In additional embodiments, the FTEP devicesmay be autoclavable.

The electrodes 408 can be formed from any suitable metal, such ascopper, stainless steel, titanium, aluminum, brass, silver, rhodium,gold or platinum, or graphite. One preferred electrode material is alloy303 (UNS330300) austenitic stainless steel. An applied electric fieldcan destroy electrodes made from of metals like aluminum. If amultiple-use (i.e., non-disposable) flow-through FTEP device isdesired-as opposed to a disposable, one-use flow-through FTEP device—theelectrode plates can be coated with metals resistant to electrochemicalcorrosion. Conductive coatings like noble metals, e.g., gold, can beused to protect the electrode plates.

As mentioned, the FTEP devices may comprise push-pull pneumatic means toallow multi-pass electroporation procedures; that is, cells toelectroporated may be “pulled” from the inlet toward the outlet for onepass of electroporation, then be “pushed” from the outlet end of theflow-through FTEP device toward the inlet end to pass between theelectrodes again for another pass of electroporation. This process maybe repeated one to many times.

Depending on the type of cells to be electroporated (e.g., bacterial,yeast, mammalian) and the configuration of the electrodes, the distancebetween the electrodes in the flow channel can vary widely. For example,where the flow channel decreases in width, the flow channel may narrowto between 10 μm and 5 mm, or between 25 μm and 3 mm, or between 50 μmand 2 mm, or between 75 μm and 1 mm. The distance between the electrodesin the flow channel may be between 1 mm and 10 mm, or between 2 mm and 8mm, or between 3 mm and 7 mm, or between 4 mm and 6 mm. The overall sizeof the FTEP device may be from 3 cm to 15 cm in length, or 4 cm to 12 cmin length, or 4.5 cm to 10 cm in length. The overall width of the FTEPdevice may be from 0.5 cm to 5 cm, or from 0.75 cm to 3 cm, or from 1 cmto 2.5 cm, or from 1 cm to 1.5 cm.

The region of the flow channel that is narrowed is wide enough so thatat least two cells can fit in the narrowed portion side-by-side. Forexample, a typical bacterial cell is 1 μm in diameter; thus, thenarrowed portion of the flow channel of the FTEP device used totransform such bacterial cells will be at least 2 μm wide. In anotherexample, if a mammalian cell is approximately 50 μm in diameter, thenarrowed portion of the flow channel of the FTEP device used totransform such mammalian cells will be at least 100 μm wide. That is,the narrowed portion of the FTEP device will not physically contort or“squeeze” the cells being transformed.

In embodiments of the FTEP device where reservoirs are used to introducecells and exogenous material into the FTEP device, the reservoirs rangein volume from 100 μL to 10 mL, or from 500 μL to 75 mL, or from 1 mL to5 mL. The flow rate in the FTEP ranges from 0.1 mL to 5 mL per minute,or from 0.5 mL to 3 mL per minute, or from 1.0 mL to 2.5 mL per minute.The pressure in the FTEP device ranges from 1-30 psi, or from 2-10 psi,or from 3-5 psi.

To avoid different field intensities between the electrodes, theelectrodes should be arranged in parallel. Furthermore, the surface ofthe electrodes should be as smooth as possible without pin holes orpeaks. Electrodes having a roughness Rz of 1 to 10 μm are preferred. Inanother embodiment of the invention, the flow-through electroporationdevice comprises at least one additional electrode which applies aground potential to the FTEP device. Flow-through electroporationdevices (either as a stand-alone instrument or as a module in anautomated multi-module system) are described in, e.g., U.S. Ser. No.16/147,120, filed 28 Sep. 2018; Ser. No. 16/147,353, filed 28 Sep. 2018;Ser. No. 16/147,865, filed 30 Sep. 2018; and Ser. No. 16/426,310, filed30 May 2019; and U.S. Pat. No. 10,323,258, issued 18 Jun. 2019.

Cell Singulation and Enrichment Device

FIG. 6A depicts a solid wall device 6050 and a workflow for singulatingcells in microwells in the solid wall device. At the top left of thefigure (i), there is depicted solid wall device 6050 with microwells6052. A section 6054 of substrate 6050 is shown at (ii), also depictingmicrowells 6052. At (iii), a side cross-section of solid wall device6050 is shown, and microwells 6052 have been loaded, where, in thisembodiment, Poisson or substantial Poisson loading has taken place; thatis, each microwell has one or no cells, and the likelihood that any onemicrowell has more than one cell is low. At (iv), workflow 6040 isillustrated where substrate 6050 having microwells 6052 shows microwells6056 with one cell per microwell, microwells 6057 with no cells in themicrowells, and one microwell 6060 with two cells in the microwell. Instep 6051, the cells in the microwells are allowed to doubleapproximately 2-150 times to form clonal colonies (v), then editing isallowed to occur 6053.

After editing 6053, many cells in the colonies of cells that have beenedited die as a result of the double-strand cuts caused by activeediting and there is a lag in growth for the edited cells that dosurvive but must repair and recover following editing (microwells 6058),where cells that do not undergo editing thrive (microwells 6059) (vi).All cells are allowed to continue grow to establish colonies andnormalize, where the colonies of edited cells in microwells 6058 catchup in size and/or cell number with the cells in microwells 6059 that donot undergo editing (vii). Once the cell colonies are normalized, eitherpooling 6060 of all cells in the microwells can take place, in whichcase the cells are enriched for edited cells by eliminating the biasfrom non-editing cells and fitness effects from editing; alternatively,colony growth in the microwells is monitored after editing, and slowgrowing colonies (e.g., the cells in microwells 6058) are identified andselected 6061 (e.g., “cherry picked”) resulting in even greaterenrichment of edited cells.

In growing the cells, the medium used will depend, of course, on thetype of cells being edited—e.g., bacterial, yeast or mammalian. Forexample, medium for yeast cell growth includes LB, SOC, TPD, YPG, YPAD,MEM and DMEM.

FIG. 6B depicts a solid wall device 6050 and a workflow forsubstantially singulating cells in microwells in a solid wall device. Atthe top left of the figure (i), there is depicted solid wall device 350with microwells 6052. A section 6054 of substrate 6050 is shown at (ii),also depicting microwells 6052. At (iii), a side cross-section of solidwall device 6050 is shown, and microwells 6052 have been loaded, where,in this embodiment, substantial Poisson loading has taken place; thatis, some microwells 6057 have no cells, and some microwells 6076, 6078have a few cells. In FIG. 6B, cells with active gRNAs are shown as solidcircles, and cells with inactive gRNAs are shown as open circles. At(iv), workflow 6070 is illustrated where substrate 6050 havingmicrowells 6052 shows three microwells 6076 with several cells all withactive gRNAs, microwell 6057 with no cells, and two microwells 6078 withsome cells having active gRNAs and some cells having inactive gRNAs. Instep 6071, the cells in the microwells are allowed to doubleapproximately 2-150 times to form clonal colonies (v), then editingtakes place 6073.

After editing 6073, many cells in the colonies of cells that have beenedited die as a result of the double-strand cuts caused by activeediting and there is a lag in growth for the edited cells that dosurvive but must repair and recover following editing (microwells 6076),where cells that do not undergo editing thrive (microwells 6078) (vi).Thus, in microwells 6076 where only cells with active gRNAs reside(cells depicted by solid circles), most cells die off; however, inmicrowells 6078 containing cells with inactive gRNAs (cells depicted byopen circles), cells continue to grow and are not impacted by activeediting. The cells in each microwell (6076 and 6078) are allowed to growto continue to establish colonies and normalize, where the colonies ofedited cells in microwells 6076 catch up in size and/or cell number withthe unedited cells in microwells 6078 that do not undergo editing (vii).Note that in this workflow 6070, the colonies of cells in the microwellsare not clonal; that is, not all cells in a well arise from a singlecell. Instead, the cell colonies in the well may be mixed colonies,arising in many wells from two to several different cells. Once the cellcolonies are normalized, either pooling 6090 of all cells in themicrowells can take place, in which case the cells are enriched foredited cells by eliminating the bias from non-editing cells and fitnesseffects from editing; alternatively, colony growth in the microwells ismonitored after editing, and slow growing colonies (e.g., the cells inmicrowells 6076) are identified and selected 6091 (e.g., “cherrypicked”) resulting in even greater enrichment of edited cells.

A module useful for performing the methods depicted in FIGS. 6A and 6Bis a solid wall isolation, incubation, and normalization (SWIIN) module.FIG. 6C depicts an embodiment of a SWIIN module 650 from an exploded topperspective view. In SWIIN module 650 the retentate member is formed onthe bottom of a top of a SWIIN module component and the permeate memberis formed on the top of the bottom of a SWIIN module component.

The SWIIN module 650 in FIG. 6C comprises from the top down, a reservoirgasket or cover 658, a retentate member 604 (where a retentate flowchannel cannot be seen in this FIG. 6C), a perforated member 601 swagedwith a filter (filter not seen in FIG. 6C), a permeate member 608comprising integrated reservoirs (permeate reservoirs 652 and retentatereservoirs 654), and two reservoir seals 662, which seal the bottom ofpermeate reservoirs 652 and retentate reservoirs 654. A permeate channel660 a can be seen disposed on the top of permeate member 608, defined bya raised portion 676 of serpentine channel 660 a, and ultrasonic tabs664 can be seen disposed on the top of permeate member 608 as well. Theperforations that form the wells on perforated member 601 are not seenin this FIG. 6C; however, through-holes 666 to accommodate theultrasonic tabs 664 are seen. In addition, supports 670 are disposed ateither end of SWIIN module 650 to support SWIIN module 650 and toelevate permeate member 608 and retentate member 604 above reservoirs652 and 654 to minimize bubbles or air entering the fluid path from thepermeate reservoir to serpentine channel 660 a or the fluid path fromthe retentate reservoir to serpentine channel 660 b (neither fluid pathis seen in this FIG. 6C).

In this FIG. 6C, it can be seen that the serpentine channel 660 a thatis disposed on the top of permeate member 608 traverses permeate member608 for most of the length of permeate member 608 except for the portionof permeate member 608 that comprises permeate reservoirs 652 andretentate reservoirs 654 and for most of the width of permeate member608. As used herein with respect to the distribution channels in theretentate member or permeate member, “most of the length” means about95% of the length of the retentate member or permeate member, or about90%, 85%, 80%, 75%, or 70% of the length of the retentate member orpermeate member. As used herein with respect to the distributionchannels in the retentate member or permeate member, “most of the width”means about 95% of the width of the retentate member or permeate member,or about 90%, 85%, 80%, 75%, or 70% of the width of the retentate memberor permeate member.

In this embodiment of a SWIIN module, the perforated member includesthrough-holes to accommodate ultrasonic tabs disposed on the permeatemember. Thus, in this embodiment the perforated member is fabricatedfrom 316 stainless steel, and the perforations form the walls ofmicrowells while a filter or membrane is used to form the bottom of themicrowells. Typically, the perforations (microwells) are approximately150 μm-200 μm in diameter, and the perforated member is approximately125 μm deep, resulting in microwells having a volume of approximately2.5 nl, with a total of approximately 200,000 microwells. The distancebetween the microwells is approximately 279 μm center-to-center. Thoughhere the microwells have a volume of approximately 2.5 nl, the volume ofthe microwells may be from 1 to 25 nl, or preferably from 2 to 10 nl,and even more preferably from 2 to 4 nl. As for the filter or membrane,like the filter described previously, filters appropriate for use aresolvent resistant, contamination free during filtration, and are able toretain the types and sizes of cells of interest. For example, in orderto retain small cell types such as bacterial cells, pore sizes can be aslow as 0.10 μm, however for other cell types (e.g., such as formammalian cells), the pore sizes can be as high as 10.0 μm-20.0 μm ormore. Indeed, the pore sizes useful in the cell concentrationdevice/module include filters with sizes from 0.10 μm, 0.11 μm, 0.12 μm,0.13 μm, 0.14 μm, 0.15 μm, 0.16 μm, 0.17 μm, 0.18 μm, 0.19 μm, 0.20 μm,0.21 μm, 0.22 μm, 0.23 μm, 0.24 μm, 0.25 μm, 0.26 μm, 0.27 μm, 0.28 μm,0.29 μm, 0.30 μm, 0.31 μm, 0.32 μm, 0.33 μm, 0.34 μm, 0.35 μm, 0.36 μm,0.37 μm, 0.38 μm, 0.39 μm, 0.40 μm, 0.41 μm, 0.42 μm, 0.43 μm, 0.44 μm,0.45 μm, 0.46 μm, 0.47 μm, 0.48 μm, 0.49 μm, 0.50 μm and larger. Thefilters may be fabricated from any suitable material including cellulosemixed ester (cellulose nitrate and acetate) (CME), polycarbonate (PC),polyvinylidene fluoride (PVDF), polyethersulfone (PES),polytetrafluoroethylene (PTFE), nylon, or glass fiber.

The cross-section configuration of the mated serpentine channel may beround, elliptical, oval, square, rectangular, trapezoidal, or irregular.If square, rectangular, or another shape with generally straight sides,the cross section may be from about 2 mm to 15 mm wide, or from 3 mm to12 mm wide, or from 5 mm to 10 mm wide. If the cross section of themated serpentine channel is generally round, oval or elliptical, theradius of the channel may be from about 3 mm to 20 mm in hydraulicradius, or from 5 mm to 15 mm in hydraulic radius, or from 8 mm to 12 mmin hydraulic radius.

Serpentine channels 660 a and 660 b can have approximately the samevolume or a different volume. For example, each “side” or portion 660 a,660 b of the serpentine channel may have a volume of, e.g., 2 mL, orserpentine channel 660 a of permeate member 608 may have a volume of 2mL, and the serpentine channel 660 b of retentate member 604 may have avolume of, e.g., 3 mL. The volume of fluid in the serpentine channel mayrange from about 2 mL to about 80 mL, or about 4 mL to 60 mL, or from 5mL to 40 mL, or from 6 mL to 20 mL (note these volumes apply to a SWIINmodule comprising a, e.g., 50-500K perforation member). The volume ofthe reservoirs may range from 5 mL to 50 mL, or from 7 mL to 40 mL, orfrom 8 mL to 30 mL or from 10 mL to 20 mL, and the volumes of allreservoirs may be the same or the volumes of the reservoirs may differ(e.g., the volume of the permeate reservoirs is greater than that of theretentate reservoirs).

The serpentine channel portions 660 a and 660 b of the permeate member608 and retentate member 604, respectively, are approximately 200 mmlong, 130 mm wide, and 4 mm thick, though in other embodiments, theretentate and permeate members can be from 75 mm to 400 mm in length, orfrom 100 mm to 300 mm in length, or from 150 mm to 250 mm in length;from 50 mm to 250 mm in width, or from 75 mm to 200 mm in width, or from100 mm to 150 mm in width; and from 2 mm to 15 mm in thickness, or from4 mm to 10 mm in thickness, or from 5 mm to 8 mm in thickness.Embodiments the retentate (and permeate) members may be fabricated fromPMMA (poly(methyl methacrylate) or other materials may be used,including polycarbonate, cyclic olefin co-polymer (COC), glass,polyvinyl chloride, polyethylene, polyamide, polypropylene, polysulfone,polyurethane, and co-polymers of these and other polymers. Preferably atleast the retentate member is fabricated from a transparent material sothat the cells can be visualized (see, e.g., FIG. 6F and the descriptionthereof). For example, a video camera may be used to monitor cell growthby, e.g., density change measurements based on an image of an emptywell, with phase contrast, or if, e.g., a chromogenic marker, such as achromogenic protein, is used to add a distinguishable color to thecells. Chromogenic markers such as blitzen blue, dreidel teal, virginiaviolet, vixen purple, prancer purple, tinsel purple, maccabee purple,donner magenta, cupid pink, seraphina pink, scrooge orange, and leororange (the Chromogenic Protein Paintbox, all available from ATUM(Newark, Calif.)) obviate the need to use fluorescence, althoughfluorescent cell markers, fluorescent proteins, and chemiluminescentcell markers may also be used.

Because the retentate member preferably is transparent, colony growth inthe SWIIN module can be monitored by automated devices such as thosesold by JoVE (ScanLag™ system, Cambridge, Mass.) (also seeLevin-Reisman, et al., Nature Methods, 7:737-39 (2010)). Cell growthfor, e.g., mammalian cells may be monitored by, e.g., the growth monitorsold by IncuCyte (Ann Arbor, Mich.) (see also, Choudhry, PLos One,11(2):e0148469 (2016)). Further, automated colony pickers may beemployed, such as those sold by, e.g., TECAN (Pickolo™ system,Mannedorf, Switzerland); Hudson Inc. (RapidPick™, Springfield, N.J.);Molecular Devices (QPix 400™ system, San Jose, Calif.); and SingerInstruments (PIXL™ system, Somerset, UK).

Due to the heating and cooling of the SWIIN module, condensation mayaccumulate on the retentate member which may interfere with accuratevisualization of the growing cell colonies. Condensation of the SWIINmodule 650 may be controlled by, e.g., moving heated air over the top of(e.g., retentate member) of the SWIIN module 650, or by applying atransparent heated lid over at least the serpentine channel portion 660b of the retentate member 604. See, e.g., FIG. 6F and the descriptionthereof infra.

In SWIIN module 650 cells and medium—at a dilution appropriate forPoisson or substantial Poisson distribution of the cells in themicrowells of the perforated member—are flowed into serpentine channel660 b from ports in retentate member 604, and the cells settle in themicrowells while the medium passes through the filter into serpentinechannel 660 a in permeate member 608. The cells are retained in themicrowells of perforated member 601 as the cells cannot travel throughfilter 603. Appropriate medium may be introduced into permeate member608 through permeate ports 611. The medium flows upward through filter603 to nourish the cells in the microwells (perforations) of perforatedmember 601. Additionally, buffer exchange can be effected by cyclingmedium through the retentate and permeate members. In operation, thecells are deposited into the microwells, are grown for an initial, e.g.,2-100 doublings, editing is induced by, e.g., raising the temperature ofthe SWIIN to 42° C. to induce a temperature inducible promoter or byremoving growth medium from the permeate member and replacing the growthmedium with a medium comprising a chemical component that induces aninducible promoter.

Once editing has taken place, the temperature of the SWIIN may bedecreased, or the inducing medium may be removed and replaced with freshmedium lacking the chemical component thereby de-activating theinducible promoter. The cells then continue to grow in the SWIIN module650 until the growth of the cell colonies in the microwells isnormalized. For the normalization protocol, once the colonies arenormalized, the colonies are flushed from the microwells by applyingfluid or air pressure (or both) to the permeate member serpentinechannel 660 a and thus to filter 603 and pooled. Alternatively, ifcherry picking is desired, the growth of the cell colonies in themicrowells is monitored, and slow-growing colonies are directlyselected; or, fast-growing colonies are eliminated.

FIG. 6D is a top perspective view of a SWIIN module with the retentateand perforated members in partial cross section. In this FIG. 6D, it canbe seen that serpentine channel 660 a is disposed on the top of permeatemember 608 is defined by raised portions 676 and traverses permeatemember 608 for most of the length and width of permeate member 608except for the portion of permeate member 608 that comprises thepermeate and retentate reservoirs (note only one retentate reservoir 652can be seen). Moving from left to right, reservoir gasket 658 isdisposed upon the integrated reservoir cover 678 (cover not seen in thisFIG. 6D) of retentate member 604. Gasket 658 comprises reservoir accessapertures 632 a, 632 b, 632 c, and 632 d, as well as pneumatic ports 633a, 633 b, 633 c and 633 d. Also at the far left end is support 670.Disposed under permeate reservoir 652 can be seen one of two reservoirseals 662. In addition to the retentate member being in cross section,the perforated member 601 and filter 603 (filter 603 is not seen in thisFIG. 6D) are in cross section. Note that there are a number ofultrasonic tabs 664 disposed at the right end of SWIIN module 650 and onraised portion 676 which defines the channel turns of serpentine channel660 a, including ultrasonic tabs 664 extending through through-holes 666of perforated member 601. There is also a support 670 at the end distalreservoirs 652, 654 of permeate member 608.

FIG. 6E is a side perspective view of an assembled SWIIIN module 650,including, from right to left, reservoir gasket 658 disposed uponintegrated reservoir cover 678 (not seen) of retentate member 604.Gasket 658 may be fabricated from rubber, silicone, nitrile rubber,polytetrafluoroethylene, a plastic polymer such aspolychlorotrifluoroethylene, or other flexible, compressible material.Gasket 658 comprises reservoir access apertures 632 a, 632 b, 632 c, and632 d, as well as pneumatic ports 633 a, 633 b, 633 c and 633 d. Also atthe far-left end is support 670 of permeate member 608. In addition,permeate reservoir 652 can be seen, as well as one reservoir seal 662.At the far-right end is a second support 670.

Imaging of cell colonies growing in the wells of the SWIIN is desired inmost implementations for, e.g., monitoring both cell growth and deviceperformance and imaging is necessary for cherry-picking implementations.Real-time monitoring of cell growth in the SWIIN requires backlighting,retentate plate (top plate) condensation management and a system-levelapproach to temperature control, air flow, and thermal management. Insome implementations, imaging employs a camera or CCD device withsufficient resolution to be able to image individual wells. For example,in some configurations a camera with a 9-pixel pitch is used (that is,there are 9 pixels center-to-center for each well). Processing theimages may, in some implementations, utilize reading the images ingrayscale, rating each pixel from low to high, where wells with no cellswill be brightest (due to full or nearly-full light transmission fromthe backlight) and wells with cells will be dim (due to cells blockinglight transmission from the backlight). After processing the images,thresholding is performed to determine which pixels will be called“bright” or “dim”, spot finding is performed to find bright pixels andarrange them into blocks, and then the spots are arranged on a hexagonalgrid of pixels that correspond to the spots. Once arranged, the measureof intensity of each well is extracted, by, e.g., looking at one or morepixels in the middle of the spot, looking at several to many pixels atrandom or pre-set positions, or averaging X number of pixels in thespot. In addition, background intensity may be subtracted. Thresholdingis again used to call each well positive (e.g., containing cells) ornegative (e.g., no cells in the well). The imaging information may beused in several ways, including taking images at time points formonitoring cell growth. Monitoring cell growth can be used to, e.g.,remove the “muffin tops” of fast-growing cells followed by removal ofall cells or removal of cells in “rounds” as described above, or recovercells from specific wells (e.g., slow-growing cell colonies);alternatively, wells containing fast-growing cells can be identified andareas of UV light covering the fast-growing cell colonies can beprojected (or rastered with shutters) onto the SWIIN to irradiate orinhibit growth of those cells. Imaging may also be used to assure properfluid flow in the serpentine channel 660.

FIG. 6F depicts the embodiment of the SWIIN module in FIGS. 6A-6Efurther comprising a heat management system including a heater and aheated cover. The heater cover facilitates the condensation managementthat is required for imaging. Assembly 698 comprises a SWIIN module 650seen lengthwise in cross section, where one permeate reservoir 652 isseen. Disposed immediately upon SWIIN module 650 is cover 694 anddisposed immediately below SWIIN module 650 is backlight 680, whichallows for imaging. Beneath and adjacent to the backlight and SWIINmodule is insulation 682, which is disposed over a heatsink 684. In thisFIG. 6F, the fins of the heatsink would be in-out of the page. Inaddition there is also axial fan 686 and heat sink 688, as well as twothermoelectric coolers 692, and a controller 690 to control thepneumatics, thermoelectric coolers, fan, solenoid valves, etc. Thearrows denote cool air coming into the unit and hot air being removedfrom the unit. It should be noted that control of heating allows forgrowth of many different types of cells (prokaryotic and eukaryotic) aswell as strains of cells that are, e.g., temperature sensitive, etc.,and allows use of temperature-sensitive promoters. Temperature controlallows for protocols to be adjusted to account for differences intransformation efficiency, cell growth and viability. For more detailsregarding solid wall isolation incubation and normalization devices seeU.S. Ser. No. 16/399,988, filed 30 Apr. 2019; Ser. No. 16/454,865, filed26 Jun. 2019; and Ser. No. 16/540,606, filed 14 Aug. 2019. Foralternative isolation, incubation and normalization modules, see U.S.Ser. No. 16/536,049, filed 8 Aug. 2019.

Cell Selection Module

The split protein reporter system described herein provides fluorescentor bioluminescent cells as a read out for properly-edited cells. Theproperly-edited cells can be sorted from non-edited or improperly-editedcells via fluorescence-activated cell sorting (FACS). FACs is aderivative of flow cytometry that adds an enhanced degree offunctionality. Using FACs, a heterogenous mixture of live cells can besorted into different populations. FACs is the only availablepurification technique to isolate cells based on internal staining orintracellular protein expression, and allows for the purification ofindividual cells based on size, granularity and fluorescence. Cells insuspension are passed as a stream in droplets with each dropletcontaining a single cell of interest. The droplets are passed in frontof a laser. An optical detection system detects cells of interest basedon predetermined optical parameters (e.g., fluorescent or bioluminescentparameters). The instrument applies a charge to a droplet containing acell of interest and an electrostatic deflection system facilitatescollection of the charged droplets into appropriate tubes or wells.Sorting parameters may be adjusted depending on the requirement ofpurity and yield. Using the split protein reporter system,properly-edited cells are bioluminescent and improperly- or un-editedcells are not bioluminescent; thus, the desired cells are easily sortedfrom unwanted cells.

Use of the Automated Multi-Module Cell Processing Instrument

One embodiment of an automated multi-module cell processing instrumentcapable of performing the methods described herein is shown in FIG. 7.The cell processing instrument 700 may include a housing, a reservoir ofcells in, e.g., the reagent cartridge where the cells are to betransformed 704. The cells are transferred from the reservoir to thecell growth and concentration module 708. In this embodiment, the cellgrowth and concentration module is a single module, such as a TFF;however, in other embodiments the cell growth and concentration modulesmay be separate, such as a cell growth module comprising a rotatinggrowth module and a cell concentration device comprising a TFF. Thecells to be processed are transferred from, e.g., a reservoir in thereagent cartridge to the cell growth module 708 to be cultured until thecells hit a target OD. Once the cells hit the target OD, the cell growthmodule may cool the cells for later processing or the cells may progressdirectly to cell concentration, where buffer or medium exchange isperformed, the cells are rendered competent, and the volume of the cellsis reduced to a volume optimal for cell transformation in atransformation module 710. The transformation module 710 may be, e.g., aflow-through electroporation device.

In addition to the reservoir for storing the cells, the reagentcartridge may include a reservoir for storing editing vectors 706comprising editing cassettes and a reservoir for storing an enginevector 702 comprising, e.g., a coding sequence for a nuclease and codingsequences for the Cascade-C-terminal T7 RNAP fusion construct and thedCas3-N-terminal T7 RNAP fusion construct. As described above inrelation to FIG. 1A, the Cascade-C-terminal T7 RNAP fusion construct andthe dCas3-N-terminal T7 RNAP fusion construct may located on the enginevector (e.g., the vector comprising the coding sequence for thenuclease), the Cascade-C-terminal T7 RNAP fusion construct and thedCas3-N-terminal T7 RNAP fusion construct may both be located on asingle reporter vector along with the reporter gene under the control ofa T7 promoter, or the various components of the split protein system maybe on different reporter vectors, on the editing vector, and/or on theengine vector. The editing vector, engine vector and reporter vectors(if separate, not shown) are then transferred to the transformationmodule 710 to be electroporated into the cells.

Once the cells have been transformed, the cells may be transferred to anediting module 712, such as a SWIIN module as described above, forediting. In addition, selection may be performed in a separate modulebetween the transformation module and the editing module, or selectionmay be performed in the editing module. Selection in this instancerefers to selecting for cells that have been properly transformed withvectors that comprise selection markers, thus assuring that the cellshave received all vectors for both nucleic acid-guided nuclease editingand for reporting proper edits. After selection, conditions are providedfor editing. If any components of the nucleic acid-guided nucleaseediting system are under the control of an inducible promoter,conditions are provided to activate the inducible promoters for editing.While the cells are editing, the split protein reporter system may beactive, wherein cells that have been properly edited are emitting light;alternatively, one or both of the fusion constructs (e.g., theCascade-C-terminal T7 RNAP fusion construct or the dCas3-N-terminal T7RNAP fusion construct) and/or the edit-discriminating gRNA may be underthe control of an inducible promoter, and the split protein reportersystem is not activated until the cells have been edited. Whether duringor after editing, the split protein reporter system when active allowsfor identification of cells that have been properly edited viabioluminescence. Following editing, the cells are transferred to aselection module 714, where the cells can be sorted.

Cells in which the split protein reporter system is active (e.g.,luminescent cells) and have been separated from cells that are notluminescent can then be grown and prepared for another round of editing.The multi-module cell processing instrument is controlled by a processor716 configured to operate the instrument based on user input, asdirected by one or more scripts, or as a combination of user input or ascript. The processor 716 may control the timing, duration, temperature,and operations of the various modules of the instrument 700 and thedispensing of reagents from the reagent cartridge. The processor may beprogrammed with standard protocol parameters from which a user mayselect, a user may specify one or more parameters manually or one ormore scripts associated with the reagent cartridge may specify one ormore operations and/or reaction parameters. In addition, the processormay notify the user (e.g., via an application to a smart phone or otherdevice) that the cells have reached a target OD, been rendered competentand concentrated, and/or update the user as to the progress of the cellsin the various modules in the multi-module instrument.

For examples of multi-module cell editing instruments, see U.S. Pat. No.10,253,316, issued 9 Apr. 2019; U.S. Pat. No. 10,329,559, issued 25 Jun.2019; U.S. Pat. No. 10,323,242, issued 18 Jun. 2019; U.S. Pat. No.10,421,959, issued 24 Sep. 2019; U.S. Pat. No. 10,465,185, issued 5 Nov.2019; and U.S. Ser. No. 16/412,195, filed 14 May 2019; Ser. No.16/571,091, filed 14 Sep. 2019; and Ser. No. 16/666,964, filed 29 Oct.2019, all of which are herein incorporated by reference in theirentirety.

It should be apparent to one of ordinary skill in the art given thepresent disclosure that the process described may be recursive andmultiplexed; that is, cells may go through the workflow described inrelation to FIG. 7, then the resulting edited culture may go throughanother (or several or many) rounds of additional editing (e.g.,recursive editing) with different editing vectors. For example, thecells from round 1 of editing may be diluted and an aliquot of theedited cells edited by editing vector A may be combined with editingvector B, an aliquot of the edited cells edited by editing vector A maybe combined with editing vector C, an aliquot of the edited cells editedby editing vector A may be combined with editing vector D, and so on fora second round of editing. After round two, an aliquot of each of thedouble-edited cells may be subjected to a third round of editing, where,e.g., aliquots of each of the AB-, AC-, AD-edited cells are combinedwith additional editing vectors, such as editing vectors X, Y, and Z.That is that double-edited cells AB may be combined with and edited byvectors X, Y, and Z to produce triple-edited edited cells ABX, ABY, andABZ; double-edited cells AC may be combined with and edited by vectorsX, Y, and Z to produce triple-edited cells ACX, ACY, and ACZ; anddouble-edited cells AD may be combined with and edited by vectors X, Y,and Z to produce triple-edited cells ADX, ADY, and ADZ, and so on. Inthis process, many permutations and combinations of edits can beexecuted, leading to very diverse cell populations and cell libraries.

In any recursive process, it is advantageous to “cure” the previousengine and editing vectors (or single engine+editing vector in a singlevector system). “Curing” is a process in which one or more vectors usedin the prior round of editing is eliminated from the transformed cells.Curing can be accomplished by, e.g., cleaving the vector(s) using acuring plasmid thereby rendering the editing and/or engine vector (orsingle, combined engine/editing vector) nonfunctional; diluting thevector(s) in the cell population via cell growth (that is, the moregrowth cycles the cells go through, the fewer daughter cells will retainthe editing or engine vector(s)), or by, e.g., utilizing aheat-sensitive origin of replication on the editing or engine vector (orcombined engine+editing vector). The conditions for curing will dependon the mechanism used for curing; that is, in this example, how thecuring plasmid cleaves the editing and/or engine vector.

EXAMPLES

The following examples are put forth so as to provide those of ordinaryskill in the art with a complete disclosure and description of how tomake and use the present invention, and are not intended to limit thescope of what the inventors regard as their invention, nor are theyintended to represent or imply that the experiments below are all of orthe only experiments performed. It will be appreciated by personsskilled in the art that numerous variations and/or modifications may bemade to the invention as shown in the specific aspects without departingfrom the spirit or scope of the invention as broadly described. Thepresent aspects are, therefore, to be considered in all respects asillustrative and not restrictive.

Example I: Fully-Automated Singleplex RGN-Directed Editing Run

Singleplex automated genomic editing using MAD7 nuclease wassuccessfully performed with an automated multi-module instrument of thedisclosure. See U.S. Pat. No. 9,982,279; and U.S. Ser. No. 16/024,831filed 30 Jun. 2018; Ser. No. 16/024,816 filed 30 Jun. 2018; Ser. No.16/147,353 filed 28 Sep. 2018; Ser. No. 16/147,865 filed 30 Sep. 2018;and Ser. No. 16/147,871 filed 30 Jun. 2018.

An ampR plasmid backbone and a lacZ_F172* editing cassette wereassembled via Gibson Assembly® into an “editing vector” in an isothermalnucleic acid assembly module included in the automated instrument.lacZ_F172 functionally knocks out the lacZ gene. “lacZ_F172*” indicatesthat the edit happens at the 172nd residue in the lacZ amino acidsequence. Following assembly, the product was de-salted in theisothermal nucleic acid assembly module using AMPure beads, washed with80% ethanol, and eluted in buffer. The assembled editing vector andrecombineering-ready, electrocompetent E. Coli cells were transferredinto a transformation module for electroporation. The cells and nucleicacids were combined and allowed to mix for 1 minute, and electroporationwas performed for 30 seconds. The parameters for the poring pulse were:voltage, 2400 V; length, 5 ms; interval, 50 ms; number of pulses, 1;polarity, +. The parameters for the transfer pulses were: Voltage, 150V; length, 50 ms; interval, 50 ms; number of pulses, 20; polarity, +/−.Following electroporation, the cells were transferred to a recoverymodule (another growth module), and allowed to recover in SOC mediumcontaining chloramphenicol. Carbenicillin was added to the medium after1 hour, and the cells were allowed to recover for another 2 hours. Afterrecovery, the cells were held at 4° C. until recovered by the user.

After the automated process and recovery, an aliquot of cells was platedon MacConkey agar base supplemented with lactose (as the sugarsubstrate), chloramphenicol and carbenicillin and grown until coloniesappeared. White colonies represented functionally edited cells, purplecolonies represented un-edited cells. All liquid transfers wereperformed by the automated liquid handling device of the automatedmulti-module cell processing instrument.

The result of the automated processing was that approximately 1.0E⁰³total cells were transformed (comparable to conventional benchtopresults), and the editing efficiency was 83.5%. The lacZ_172 edit in thewhite colonies was confirmed by sequencing of the edited region of thegenome of the cells. Further, steps of the automated cell processingwere observed remotely by webcam and text messages were sent to updatethe status of the automated processing procedure.

Example II: Fully-Automated Recursive Editing Run

Recursive editing was successfully achieved using the automatedmulti-module cell processing system. An ampR plasmid backbone and alacZ_V10* editing cassette were assembled via Gibson Assembly® into an“editing vector” in an isothermal nucleic acid assembly module includedin the automated system. Similar to the lacZ_F172 edit, the lacZ_V10edit functionally knocks out the lacZ gene. “lacZ_V10” indicates thatthe edit happens at amino acid position 10 in the lacZ amino acidsequence. Following assembly, the product was de-salted in theisothermal nucleic acid assembly module using AMPure beads, washed with80% ethanol, and eluted in buffer. The first assembled editing vectorand the recombineering-ready electrocompetent E. Coli cells weretransferred into a transformation module for electroporation. The cellsand nucleic acids were combined and allowed to mix for 1 minute, andelectroporation was performed for 30 seconds. The parameters for theporing pulse were: voltage, 2400 V; length, 5 ms; interval, 50 ms;number of pulses, 1; polarity, +. The parameters for the transfer pulseswere: Voltage, 150 V; length, 50 ms; interval, 50 ms; number of pulses,20; polarity, +/−. Following electroporation, the cells were transferredto a recovery module (another growth module) allowed to recover in SOCmedium containing chloramphenicol. Carbenicillin was added to the mediumafter 1 hour, and the cells were grown for another 2 hours. The cellswere then transferred to a centrifuge module and a media exchange wasthen performed. Cells were resuspended in TB containing chloramphenicoland carbenicillin where the cells were grown to OD600 of 2.7, thenconcentrated and rendered electrocompetent.

During cell growth, a second editing vector was prepared in anisothermal nucleic acid assembly module. The second editing vectorcomprised a kanamycin resistance gene, and the editing cassettecomprised a galK Y145* edit. If successful, the galK Y145* edit conferson the cells the ability to uptake and metabolize galactose. The editgenerated by the galK Y154* cassette introduces a stop codon at the154th amino acid reside, changing the tyrosine amino acid to a stopcodon. This edit makes the galK gene product nonfunctional and inhibitsthe cells from being able to metabolize galactose. Following assembly,the second editing vector product was de-salted in the isothermalnucleic acid assembly module using AMPure beads, washed with 80%ethanol, and eluted in buffer. The assembled second editing vector andthe electrocompetent E. Coli cells (that were transformed with andselected for the first editing vector) were transferred into atransformation module for electroporation, using the same parameters asdetailed above. Following electroporation, the cells were transferred toa recovery module (another growth module), allowed to recover in SOCmedium containing carbenicillin. After recovery, the cells were held at4° C. until retrieved, after which an aliquot of cells were plated on LBagar supplemented with chloramphenicol, and kanamycin. To quantify bothlacZ and galK edits, replica patch plates were generated on two mediatypes: 1) MacConkey agar base supplemented with lactose (as the sugarsubstrate), chloramphenicol, and kanamycin, and 2) MacConkey agar basesupplemented with galactose (as the sugar substrate), chloramphenicol,and kanamycin. All liquid transfers were performed by the automatedliquid handling device of the automated multi-module cell processingsystem.

In this recursive editing experiment, 41% of the colonies screened hadboth the lacZ and galK edits, the results of which were comparable tothe double editing efficiencies obtained using a “benchtop” or manualapproach.

Example III: Isolation of Cells with Specified Genotypes in a Population

Following automated singleplex or recursive editing runs, a populationof cells is produced that contains multiple specific genotypescorresponding to complete intended edits, incomplete edits, and uneditedor wild type cells. To identify and isolate only the complete intendededits, a modified Type I CRISPR system is utilized in which two halvesof a split T7 RNAP (e.g., the portions of a split protein reportersystem) are fused onto the cascade complex and deactivated cas3nuclease, respectively. After discriminatory recognition of the completeintended edit via formation of an R-loop between the cascade complexwith an edit-discriminating crRNA, the deactivated cas3 fusion proteinis recruited to the site of the R-loop and binds the cascade complex.This binding event brings the two halves of the split T7 RNAP intoproximity forming an active T7 polymerase. The active T7 polymerase thentranscribes a reporter gene.

A T. fusca XY cascade complex fused to a T7 C-terminal fragment wasrecombinantly expressed and purified via a three plasmid co-expressionsystem in E. coli BL21 cells grown on LB media. The first plasmidcontained the cse1 protein of the T. fusca XY cascade complex with aC-terminal fragment (amino acids 181-883) of the T7 RNAP polymerasefused onto the C-terminal with a 20aa flexibly GlySer linker on apTAC-MAT-Tag1 (Sigma Aldrich) vector containing chloramphenicolresistance (see FIG. 8A). The second plasmid, on a pTAC-MAT-Tag1 vectorwith ampicillin resistance, contained the remaining cascade complexgenes, cse2-cas7-NLS-cas5e-cas6e, with an N23A mutation on cse2 toencourage R-loop formation at 37° C. and an SV40 NLS signal on theC-terminal of the cas7 gene, to make pTAC-cse2-cas7-NLS-cas5e-cas6e (seeFIG. 8B). The third plasmid contained the crRNA expressed from asynthetic CRISPR array to make pTAC-crRNA-GFP (FIG. 8C). E. coli BL21cells containing all three plasmids were grown to an OD of 0.6 andexpression was induced by adding IPTG to a concentration of 0.5 mMbefore letting cells grow overnight. Cells were then harvested, lysedusing lysozyme and the cascade complex with fused T7-C-terminal RNAP waspurified with Ni-NTA Agarose (Qiagen) according to manufacturer'sprotocol.

Separately, the deactivated T. fusca XY cas3 was recombinantly expressedand purified. The T. fusca XY cas3 D84A D481A with an N-terminal fusionof T7 RNAP (amino acids 1-179) followed by a 14aa flexible GlySer linkerand a C-terminal SV40 NLS and 6×His tag was cloned into pTAC-MAT-Tag1(Sigma Aldrich) to make pTAC-T7-Nterm-cas3-NLS-His (FIG. 8D). E. coliBL21 cells containing the plasmid were grown to an OD of 0.6 andexpression was induced by adding IPTG to a concentration of 1 mM beforeletting cells grow overnight at 18° C. Cells were then lysed and thedeactivated cas3-T7-Nterminal fusion was purified with Ni-NTA Agarose(Qiagen).

The pool of previously edited HEK293T-GFP cells were electroporatedusing the Neon Transfection system (ThermoFisher). Edited cells weretrypsinized, washed with 1×DPBS (ThermoFisher) and resuspended in NeonBuffer R. 80 pmol of cascade-T7-C-terminal, 20 pmol ofcas3-T7-N-terminal, and 5 pmol of F30-2×dBroccoli (pJin141) driven bythe T7 promoter were mixed with approximately 1.0E⁰⁵ cells in buffer R.The mixture was electroporated with a 10 μl Neon tip (1100V, 20 ms, 2pulses) and plated in 24-well plates. After 72 hours cells were sortedon BD FACSMelody™ cells sorter based on the highest fluorescence torecover only cells which contained the complete intended edit programmedfor by the cascade crRNA.

While this invention is satisfied by embodiments in many differentforms, as described in detail in connection with preferred embodimentsof the invention, it is understood that the present disclosure is to beconsidered as exemplary of the principles of the invention and is notintended to limit the invention to the specific embodiments illustratedand described herein. Numerous variations may be made by persons skilledin the art without departure from the spirit of the invention. The scopeof the invention will be measured by the appended claims and theirequivalents. The abstract and the title are not to be construed aslimiting the scope of the present invention, as their purpose is toenable the appropriate authorities, as well as the general public, toquickly determine the general nature of the invention. In the claimsthat follow, unless the term “means” is used, none of the features orelements recited therein should be construed as means-plus-functionlimitations pursuant to 35 U.S.C. § 112, ¶6.

We claim:
 1. A method for in vivo identification of cells properlyedited using a Type-I CRISPR nucleic acid-guided nuclease editingsystem, the method comprising the steps of: transforming the cells witha Type-I CRISPR nucleic acid-guided nuclease split protein editingsystem comprising: a nucleic acid coding for a Cascade-T7 RNA polymerase(“Cascade-T7-RNAP”) fusion protein in a vector backbone, wherein thenucleic acid coding for the Cascade-T7-RNAP fusion protein comprises anucleic acid coding for a protein from a Cascade complex fused toC-terminal amino acids of the T7-RNAP or N-terminal amino acids of theT7-RNAP; a nucleic acid coding for a dCas3-T7 RNA polymerase(“dCas3-T7-RNAP”) fusion protein in a vector backbone, wherein thenucleic acid coding for the dCas3-T7-RNAP fusion protein comprises anucleic acid coding for a dCas3 protein fused to a nucleic acid codingfor N-terminal amino acids of the T7-RNAP when the nucleic acid codingfor the Cascade-T7-RNAP fusion protein comprises a nucleic acid codingfor C-terminal amino acids of the T7-RNAP; and wherein the nucleic acidcoding for the dCas3-T7-RNAP fusion protein comprises a nucleic acidcoding for C-terminal amino acids of the T7-RNAP when the nucleic acidcoding for the Cascade-T7-RNAP fusion protein comprises a nucleic acidcoding for N-terminal amino acids of the T7-RNAP; a sequence for an editdiscriminating gRNA in a vector backbone, wherein the editdiscriminating gRNA recognizes a rationally-designed edited locus in agenome sequence but does not recognize the locus in the genome sequencein an unedited or incorrectly-edited condition; a donor DNA homologousto the genomic locus; and a coding sequence for a reporter gene underthe control of a T7 promoter; allowing the Type-I CRISPR nucleicacid-guided nuclease editing system to edit the locus in the genomesequence of the cells; providing conditions for the T7 RNAP to bind andactivate the T7 promoter thereby transcribing the reporter gene; andidentifying the cells expressing the reporter gene.
 2. The method ofclaim 1, wherein the Cascade-T7-RNAP fusion protein coding sequencecomprises the C-terminus amino acids of the T7 RNAP.
 3. The method ofclaim 2, wherein the C-terminus amino acids of the T7 RNAP comprisesamino acid residues 181-883 of T7 RNAP.
 4. The method of claim 2,wherein the C-terminus amino acids of the T7-RNAP is fused to aC-terminus of Cas5 or cse1/casA.
 5. The method of claim 1, wherein thenucleic acids coding for the dCas3-T7 RNAP fusion protein code for theN-terminus amino acids of the T7 RNAP.
 6. The method of claim 5, whereinthe N-terminus amino acids of the T7 RNAP comprises amino acid residues1-179 of T7 RNAP.
 7. The method of claim 5, wherein the N-terminus aminoacids of the T7 RNAP is fused to an N-terminus of dCas3.
 8. The methodof claim 1, wherein the reporter gene is a coding sequence for afluorescent protein.
 9. The method of claim 8, wherein the reporter geneis green fluorescent protein.
 10. The method of claim 8, wherein thereporter gene is blue fluorescent protein.
 11. The method of claim 1,wherein the reporter gene is a coding sequence for an antibioticresistance gene.
 12. The method of claim 1, wherein the reporter gene isan aptamer.
 13. The method of claim 12, wherein the reporter gene is abroccoli aptamer.
 14. The method of claim 12, wherein the reporter geneis a spinach aptamer.
 15. The method of claim 1, wherein the reportergene is a coding sequence for a cell surface marker protein.
 16. Themethod of claim 1, wherein the reporter gene is a coding sequence forluciferase.
 17. The method of claim 16, wherein the luciferase isfirefly luciferase.
 18. The method of claim 16, wherein the luciferaseis Renilla luciferase.
 19. The method of claim 1, wherein the nucleicacids coding for the Cascade-T7 RNAP fusion protein, the nucleic acidscoding for the dCas3-T7 RNAP fusion protein, the sequence for the gRNA,and coding sequence for a reporter gene under the control of a T7promoter are all on the same vector.
 20. The method of claim 1, whereinthe nucleic acids coding for the Cascade-T7 RNAP fusion protein, thenucleic acids coding for the dCas3-T7 RNAP fusion protein, the sequencefor the gRNA, and the coding sequence for a reporter gene under thecontrol of a T7 promoter are on two or more different vectors.
 21. Acell comprising the vector backbones of claim
 1. 22. The method of claim1, wherein the nucleic acids coding for the Cascade-T7-RNAP fusionprotein code for the N-terminus amino acids of the T7 RNAP.
 23. Themethod of claim 22, wherein the N-terminus amino acids of the T7-RNAP isfused to an N-terminus of Cas5 or cse1/casA.
 24. The method of claim 1,wherein the nucleic acids coding for the dCas3-T7 RNAP fusion proteincode for the C-terminus amino acids of the T7 RNAP.
 25. The method ofclaim 24, wherein the C-terminus amino acids of the T7 RNAP is fused toa C-terminus of dCas3.
 26. A method for in vivo identification of cellsproperly edited using a Type-I CRISPR nucleic acid-guided nucleaseediting system, the method comprising the steps of: transforming thecells with a Type-I CRISPR nucleic acid-guided nuclease split proteinediting system comprising: a nucleic acid coding for a Cascade-T7 RNApolymerase (“Cascade-T7-RNAP”) fusion protein in a vector backbone,wherein the nucleic acid coding for the Cascade-T7-RNAP fusion proteincomprises a nucleic acid coding for a protein from a Cascade complexfused to C-terminal amino acids of the T7-RNAP; a nucleic acid codingfor a dCas3-T7 RNA polymerase (“dCas3-T7-RNAP”) fusion protein in avector backbone, wherein the nucleic acid coding for the dCas3-T7-RNAPfusion protein comprises a nucleic acid coding for a dCas3 protein fusedto a nucleic acid coding for N-terminal amino acids of the T7-RNAP; asequence for an edit discriminating gRNA in a vector backbone, whereinthe edit discriminating gRNA recognizes a rationally-designed editedlocus in a genome sequence but does not recognize the locus in thegenome sequence in an unedited or incorrectly-edited condition; a donorDNA homologous to the genomic locus; and a coding sequence for areporter gene under the control of a T7 promoter; allowing the Type-ICRISPR nucleic acid-guided nuclease editing system to edit the locus inthe genome sequence of the cells; providing conditions for the T7 RNAPto bind and activate the T7 promoter thereby transcribing the reportergene; and identifying the cells expressing the reporter gene.
 27. Themethod of claim 22, wherein the C-terminus amino acids of the T7 RNAPcomprises amino acid residues 181-883 of T7 RNAP.
 28. The method ofclaim 22, wherein the N-terminus amino acids of the T7 RNAP comprisesamino acid residues 1-179 of T7 RNAP.
 29. A method for in vivoidentification of cells properly edited using a Type-I CRISPR nucleicacid-guided nuclease editing system, the method comprising the steps of:transforming the cells with a Type-I CRISPR nucleic acid-guided nucleasesplit protein editing system comprising: a nucleic acid coding for aCascade-T7 RNA polymerase (“Cascade-T7-RNAP”) fusion protein in a vectorbackbone, wherein the nucleic acid coding for the Cascade-T7-RNAP fusionprotein comprises a nucleic acid coding for a protein from a Cascadecomplex fused to N-terminal amino acids of the T7-RNAP; a nucleic acidcoding for a dCas3-T7 RNA polymerase (“dCas3-T7-RNAP”) fusion protein ina vector backbone, wherein the nucleic acid coding for the dCas3-T7-RNAPfusion protein comprises a nucleic acid coding for a dCas3 protein fusedto a nucleic acid coding for C-terminal amino acids of the T7-RNAP; asequence for an edit discriminating gRNA in a vector backbone, whereinthe edit discriminating gRNA recognizes a rationally-designed editedlocus in a genome sequence but does not recognize the locus in thegenome sequence in an unedited or incorrectly-edited condition; a donorDNA homologous to the genomic locus; and a coding sequence for areporter gene under the control of a T7 promoter; allowing the Type-ICRISPR nucleic acid-guided nuclease editing system to edit the locus inthe genome sequence of the cells; providing conditions for the T7 RNAPto bind and activate the T7 promoter thereby transcribing the reportergene; and identifying the cells expressing the reporter gene.
 30. Themethod of claim 22, wherein the C-terminus amino acids of the T7 RNAPcomprises amino acid residues 181-883 of T7 RNAP and the N-terminusamino acids of the T7 RNAP comprises amino acid residues 1-179 of T7RNAP.